Web Science and the Mind

Sunday, 8 June 2014

Natural Language Processing on the Web

GUY LAPALME
University of Montreal
IRO, RALI

VIDEO

Overview: Even with the advent of the semantic web, most of the content available on the web is still in natural language, more than half of it in English, but more and more of it in other languages also. We will present some of the links (pun intended) between natural language processing (NLP) and the web: how NLP helps in processing information on the web, but also how web technologies help in the development of NLP technologies.

READINGS

    Lapalme, G. (2013) XML: Looking at the Forest Instead of the Trees
    Lapalme, G., P. Langlais, and F. Gotti (2012) The Bilingual Concordancer TransSearch, NAACL 2012
    Gotti, F., P. Langlais, and G. Lapalme (2014) Designing a Machine Translation System for Canadian Weather Warnings: a Case Study, Natural Language Engineering 20(3): 399-433

32 comments:

Rachel18 July 2014 at 06:21
A recent fad in sentiment analysis is to create a software that can detect sarcasm and false positives (spawned by the bid put out by the US Secret Service). Do you think it is possible to create an NLP software that does this? It seems to me that a person detects sarcasm because she knows the context. If a celebrity wears a really ugly dress and someone tweets, "That celebrity's dress is the prettiest thing I have ever seen," only people who have seen the dress and can evaluate its prettiness can detect the tweet's sarcasm. Do you have any ideas for how a machine could overcome this?
ReplyDelete
Replies
Unknown18 July 2014 at 06:41
Dear Guy, Thank you very much for this presentation ! I have a branch of questions : It’s fascinating that robots are more present than humans on the web ! So, the global brain will be firstly from or for robots than for humans? We should give more precise statistics about this interpretation. In this thread, what’s the main role of NLP and the main challenge for the future to build thinking machines? In a point of view for philosophy of mind, does NLP can model intentionality?
ReplyDelete
Replies
Spaceweaver18 July 2014 at 06:43
Natural language is deeply anchored in the world around us. With all the advancements in NLP isn't it correct to say that computers will never fully understand natural language without knowledge of the world? This knowledge however is not fully encoded in language. Therefore, isn't it correct to say that computers must acquire wide cognitive competences in order to be able to interact in the physical dimension and as a consequence to evolve language capabilities?
ReplyDelete
Replies
Unknown18 July 2014 at 06:43
You said that identical subject, property and object must have the same URL. But sometime one thing has two different names, as it’s the case for Venus and the evening star. What could be done for this kind of case?
ReplyDelete
Replies
Maxwell J. D. Ramstead18 July 2014 at 06:54
This was a very helpful presentation. Professor Lapalme mentioned that NLP is everywhere on the Web already. I have a naïve question. What would the Web look like without NLP? What are the main functionalities that we take for granted, which would no longer be available without NLP?
ReplyDelete
Replies
Unknown18 July 2014 at 06:57
Je voudrais savoir comment est le «Big Data» relié en NLP? Est-ce qu'ils font partie du même domaine d'application, ou elles font partie de deux domaines distincts où les experts s'entraident?

Aussi, j'entends beaucoup de professionels dire qu'ils travaillent dans le domaine de IR et d'autres en NLP. Est-ce qu'ils sont deux domaines d'application distincts? Ou bien NLP est simplement une généralisation de IR?
ReplyDelete
Replies
Unknown18 July 2014 at 06:58
Quelle est la différence entre le domaine de IE et le domaine de IR?
ReplyDelete
Replies
Spaceweaver18 July 2014 at 06:58
Are there any evolutionary methods of NLP? I.e. usage of genetic algorithms to breed better language processors? How is this field impacted by new methods of AI such as deep learning?

Thanks for a very clear talk!
ReplyDelete
Replies
Unknown18 July 2014 at 07:01
Actuellement, est-ce qu'il existe une implémentation d'une grammaire formelle complète d'une langue naturelle telle que le Français ou l'Anglais? Est-ce un domaine qui intéresse beaucoup de chercheurs ou bien nous sommes dans un cul de sac? Qu'est-ce vous en pensez?
ReplyDelete
Replies
Eltaani Redha18 July 2014 at 07:03
Very interesting talk and works, Thank you. My question is about WEB-NLP-WEB, you said Google should be aware about using his own translation to improve his system , but we can see that Google allow people to contribute to these translations (there is a lot errors even from people), is it the good way to improve the power on NLP or we should let this task only for the linguistic ?
ReplyDelete
Replies
Louis Chartrand18 July 2014 at 07:11
Vous parlez du web comme quelque chose d'important pour le développement du TLN. Certaines personnes vont plus loin, et y voient une révolution scientifique. E.g. Steadman (2013) :

« The big data approach to intelligence gathering allows an analyst to get the full resolution on worldwide affairs. Nothing is lost from looking too closely at one particular section of data; nothing is lost from trying to get too wide a perspective on a situation that the fine detail is lost. The algorithms find the patterns and the hypothesis follows from the data. The analyst doesn't even have to bother proposing a hypothesis any more. »

Il me semble que Zodiac serait un exemple de ce genre de chose – comme vous dites, c'est du TLN sans TLN, ou on pourrait dire que c'est du TLN sans théorie de TLN. Est-ce que ça correspond à votre expérience ?
ReplyDelete
Replies
Unknown18 July 2014 at 07:18
Given that this field is progressing, would you speculate some future date range when speech recognition processing with be on par or exceed the NLP of words? As Rachel referenced, there must be contextual information in a tone of speech and visual representation in the S-O-P that would provide greater analysis.

On Weitas comment - I think there is a huge proportion of content driven articles and blogs which not only have the immediate context of the particular web page but embedded in larger social social discourse. For these reasons, I think it is logical to see NLP as the foundation on which to extrapolate this research. If I am wrong, then a response should still address the need to classfiy this contextual information.
ReplyDelete
Replies
Fernanda Pérez-Gay Juárez18 July 2014 at 07:33
Nice talk! I really liked the highlight that the web is still full of human language. I considered this was a good complement for all the talks we heard on. the semantic web and data mining. I also liked that fact that NLP tries to build a link between our language and that of the semantic web.
I have two -probably very naive- questions:
1. What problems can NLP adress that escape other strategies of data mining on the semantic web?
2. What about a tool that puts both strategies together? (semantic web data mining + natural language processing) Does it exist? What does it do and what results has it thrown so far?
ReplyDelete
Replies
Unknown19 July 2014 at 06:09
You mentioned that NLP helps the web, and also that the web helps NLP. Another way in which the web helps NLP, is google's "did you mean ________" feature. With this feature, they're able to create a catalog of common spelling mistakes associated with each english word.
ReplyDelete
Replies
Unknown30 August 2014 at 18:23
Would it be possible to use NLP to convert natural language into a computer programming language? Such that, I could describe a web page and the computer would write the HTML/CSS for me. Or, perhaps you could describe the the computer what you wanted a program to do, and it would write the code for you.
ReplyDelete
Replies

Add comment