Blogs.

January 28, 2010 by Aitor Macia
A blog is a regularly updated Web site that collects chronological texts or articles from one or more authors, the most recent one appearing first, where the author always retains the freedom of creating everything he wants.
This blog or weblog English word comes from the words ‘web’ and ‘log’, meaning the last one diary. Normally, every blog comes to show us the personality of its autor, because of the freedom he has in order to create and write everything he thinks is interesting.
It is possible to establish a dialogue in a blog because its readers can write comments talking about it, and the autor can also answer to that comments he receives. But it is the autor that chooses who can make a comment about his article and who cannot do it, existing tools which restrict the access to some people.
There are many kind of blogs: journalistic, personal, politic blogs….. and they are each day more common and popular, being nowadays more than 100 million of blogs. They use to combine text, different images and many link to other pages or sites.  And it is thought that this kind of pages will be more important in the future than they are now.
 
References:
 

CiteUlike and Google Books.

January 28, 2010 by Aitor Macia

Let’s talk about two of the most useful and interesting services which we can find in the Web. I am referring to CiteUlike and as well Google Books.

  • CiteUlike: it is a famous free online service that we can use in order to organize or arrange many academic publications. It was created and put on the web in October 2004, at the same time that its creator, Richard Cameron, was attached to the University of Manchester. It is designed concretely for the needs of scientifics and also scholars. It is a free social marker that helps anyone to store, organize, share and recommend those scientific articles you are reading. Site CiteUlike extracts automatically the details of the bibliographic reference that you have recently attached.
  • Google Books: it is a service that allows people search and find information about any book they want by internet. In the 2003 Google began scanning thousands of works whether copyrighted or not for easy access to readers. By the year 2009 Google had already scanned over ten millions of books. We can also find some entire books if we select the option of  ‘entire books’ on the advanced search.

 

References:

Markup language.

January 28, 2010 by Aitor Macia

Processing, definition and presentation of texts are the main goals and the reason why  the markup language was designed. A markup language is a form of codifying a document that, together with the text, incorporates labels or marks that contain more information about the structure of the text or his presentation. The more widespread of them is the HTML, foundation of the World Wide Web.

We can often differentiate between three types of markup languages, although we can combine several classes in the same document. For example, the HTML tag contains purely procedural, together with others purely descriptive. The HTML also includes the element PRE, which says that the text must be represented at the same way that is written.

Let’s talk about the three types of markup languages used in HTML:

  • The presentational: this type allows us to see the format of the text. That way we can make up the presentaion of certain documents for its lecture. It is not interested in the functions of the text, only in its appearance.
  • The procedural one: it is focused on the presentation of the text, even if it is also visible for that who edits the text. Some of its examples are:  troff, TeX and PostScript.
  • The descriptive one: it uses tags in order to describe the fragments of the text, but it doesn´t specifie how they should be represented. The languages expressly designed to generate marking descriptive are the SGML and XML.

Any computer file can deliver HTML documents, being the most common ones the web sites or emails.

References:

Digital libraries

October 25, 2009 by Aitor Macia

A digital or virtual library is a library in which a significant proportion of the information resources are available in the digital format, accessible through the computers, and whose  essential exponent is the Internet. This type of library allows the user to find the documents when he/she needs them, and for doing this, it responds dynamically from its network of information sources.

One of the most important ones is The Darlington Digital Library, set in the University os Pitsburgh. As it´s said in the library´s own page:  ”The Darlington Digital Library was created from the first major collection of books, manuscripts, atlases, and maps donated to the University of Pittsburgh”. This spectacular library was created by Mr. Drlington, who started collecting maps, manuscripts, works of art etc. in the year 1840. All these acquisitions were digitalized in the summer of 2006.

There are some other very important digital libraries, like the Furness Shakespeare Collection (University of Pensylvania), which I should mention at least.

References:

Main differences between the ebook and the paper book.

October 8, 2009 by Aitor Macia

Writing a text which faces the electronic book and the paper book doesn´t mean to put one of them above the other one. Doing this we only want to see the main differences between them. We must say that the electronic book is not going to abolish the paper book, at least,  by now. Remember that the printed book has had a main function in the growth of any language, and also in their never-ending.

Even if they have some similarities, this two type of books have many other differences between them. The ebook can be easily transferred from one place to another without being forced to do it physically. For doing this you must be surfing the Internet, but that fact is really easy to carry on nowadays. An electronic book may contain more information and the possibility of accessing to any external data, while the printed book works under the equation “to grater content, the greater number of paper and weight”. Ebooks have also the advantage of being easier to distribute through Internet, and that fact makes them cheaper, and sometimes free, which means that its autor has the opportunity  of being read by many surfers. Electronic book’s last advantage is that it allows us to see videos, imagine, sound etc. Finally we should say something in favour to printed books. When the reader wants to take some note of what he has read it is much more difficult to do it with an ebook, even if it is also available.

In short, both types of book have some advantages and also disadvantages, but it would be great for us the endurance of them all.

 

References.

The author and authorship in the digital world.

September 27, 2009 by Aitor Macia

Since the advent of the printing press untill today, the author´s concept has moved to set the idea of what we now understand as such. The process of setting the text, considering the need for a literate society, recognizing the sovereignty of the author, encouraging the creation of a canon of literary works and the birth of every one of the market and book professionals, was a slow process that took centuries. However, the birth of hypertext has broken with two of the main concepts which were added to author´s figure: the setting of the text and the membership of the same material. This means that the concept of authorship has changed.

The author has lost now part of his power and also of his mastery because once he publishes a document on the internet it becomes available to all the internet surfers. Some critics claim that electronic text, and hypertext in particular, is killing the author, and there are many reasons for thinking so because nowadays anyone can copy a document as if it was his/hers. Other critics don´t think the hypertext carries bad consequences for the authors. They say that now the reader has much more opportunities and that benefits the author´s figure.

References:

Natural language processing (3rd questionnaire).

June 21, 2009 by Aitor Macia

The Natural Language Processing (NLP or PLN),  is a sub Artificial Intelligence and engineering branch of the computational linguistics. The PLN is responsible for the formulation and research of effective mechanisms computationally for communication between people or between individuals and machines by means of natural languages. It has much to do with the field of computational linguistics.

This system involves both text and speech, but the work done in the speech processing belongs now to another field which is separated from Natural Language Processing. This new technology is being developed by some companies, which pay high sums of money in order to carry out they purpose. They are trying to design a software that can analyze, understand and also generate a language naturally used by humans .

 

References:

Question answering systems (3rd questionnaire).

June 19, 2009 by Aitor Macia

The main purpose of question answering is to answer automatically to those questions which are  made in a natural language.

The search for answers, called in English Question Answering (QA) is a type of information retrieval. Given a certain amount of documents (such as World Wide Web), the system should be able to recover answers to questions raised in natural language.  A system of question answering is one of the most complex systems around the retrieval of information. We must take into account that a system based on the question-answering is much more difficult than a normal system which is responsible for seeking some information in a much more or less large documents, since these should draw from these documents a fragment of text  to respond to a question given in natural language. These systems are closely linked to the seekers web.

There can be two types of questions answered:

  • The first of them are called close-domain questions and can be easily answered because they belong to a topic which can be found on the Internet without any problem.
  • The second ones are called open-domain questions and are harder to answer because they rely on world knowledge.

Even if many questions can be answered due to this system, a hard work must be done in the future in order to be able of answering all of them.

 

References:

Computational Semantics (2nd questionnaire).

June 19, 2009 by Aitor Macia

The computational semantics is a very well done study about the automating of the process of reasoning and constructing withnatural language expressions and also with meaning representations. It is also extremely important in computational linguistics and natural language processing. It has some topics of interest and some of them are: construction of meaning representations, semantic underspecification and anaphora resolution. There are some other traditional topics, but we are not going to talk about them now.

If there is something that must be said about natural language is that it has meaning. There are no doubts about this. And Semantics is the study of that meaning. The Semantics conducts this study in a formal way and in Computational ones the interest is in using the results of that study.

This is a relatively new discipline and as we have said before it combines formal semantics with computational linguistics and also with automated reasoning.  But we require a fully specified syntax for the fragment to guide the process of constructing semantic representations for that fragment of English.

 

References:

The Semantic Web (2nd questionnaire).

May 18, 2009 by Aitor Macia

When we talk about the Semantic Web, we  are referring to an extension of the World Wide Web. Here, the semantics of information and services on the web is defined, and it makes possible for the web to understand and satisfy the requests of people and machines to use the web content.

Tim Berners-Lee, the man who percuted this idea, tried since the beggining to include some semantic information in his creation, but finally it became impossible for him. That was the main reason for his including of the semantic concept.

It has been described in some different ways  such as: ” an utopic vision”, “a web of data” or “a natural paradigm shift”. It has had a great development and has helped creating many different tools like “Wikidsmart”.

 

References: