Natural Language Processing for Social Media. Diana Inkpen

Чтение книги онлайн.

Читать онлайн книгу Natural Language Processing for Social Media - Diana Inkpen страница 4

Natural Language Processing for Social Media - Diana  Inkpen Synthesis Lectures on Human Language Technologies

Скачать книгу

       4.8 NLP-based User Modeling

       4.9 Applications for Entertainment

       4.10 NLP-based Information Visualization for Social Media

       4.11 Government Communication

       4.12 Summary

       5 Data Collection, Annotation, and Evaluation

       5.1 Introduction

       5.2 Discussion on Data Collection and Annotation

       5.3 Spam and Noise Detection

       5.4 Privacy and Democracy in Social Media

       5.5 Evaluation Benchmarks

       5.6 Summary

       6 Conclusion and Perspectives

       6.1 Conclusion

       6.2 Perspectives

       A TRANSLI: a Case Study for Social Media Analytics and Monitoring

       A.1 TRANSLI architecture

       A.2 User Interface

       Glossary

       Bibliography

       Authors’ Biographies

       Index

       Preface to the Second Edition

      This book presents the state-of-the-art in research and empirical studies in the field of Natural Language Processing (NLP) for the semantic analysis of social media data. Because the field is continuously growing, this second edition adds information about recently proposed methods and their results for the tasks and applications that we covered in the first edition.

      Over the past few years, online social networking sites have revolutionized the way we communicate with individuals, groups, and communities, and altered everyday practices. The unprecedented volume and variety of user-generated content and the user interaction network constitute new opportunities for understanding social behavior and building socially intelligent systems.

      Much of the research on social networks and the mining of the social web is based on graph theory. That is apt because a social structure is made up of a set of social actors and a set of the dyadic ties between these actors. We believe that the graph mining methods for structure and information diffusion or influence spread in social networks need to be combined with the content analysis of social media. This provides the opportunity for new applications that use the information publicly available as a result of social interactions. Adapted classic NLP methods can partially solve the problem of social media content analysis focusing on the posted messages. When we receive a text of less than 10 characters, including an emoticon and a heart, we understand it and even respond to it! It is impossible to use NLP methods to process this type of document, but there is a logical message in social media data based on which two people can communicate. The same logic dominates worldwide, and people from all over the world use it to share and communicate with each other. There is a new and challenging language for NLP.

      We believe that we need new theories and algorithms for semantic analysis of social media data, as well as a new way of approaching the big data processing. By semantic analysis, in this book, we mean the linguistic processing of the social media messages enhanced with semantics, and possibly also combining this with the structure of the social networks. We actually use the term in a more general sense to refer to applications that do intelligent processing of social media texts and meta-data. Some applications could access very large amounts of data; therefore, the algorithms need to be adapted to be able process data (big data) in an online fashion and without necessarily storing all the data.

      This motivated us to give two tutorials: Applications of Social Media Text Analysis at EMNLP 20151 and Natural Language Processing for Social Media at the 29th Canadian Conference on Artificial Intelligence (AI 2016).2 We also organized several workshops on this topic, Semantic Analysis in Social Networks (SASM 2012)3 and Language Analysis in Social Media (LASM 20134 and LASM 20145), in conjunction with conferences organized by the Association for Computational Linguistics6 (ACL, EACL, and NAACL-HLT).

      Our goal was to reflect a wide range of research and results in the analysis of language with implications for fields such as NLP, computational linguistics, sociolinguistics, and psycholinguistics. Our workshops invited original research on all topics related to the analysis of language in social media, including the following topics.

      • What do people talk about on social media?

      • How do they express themselves?

      • Why do they post on social media?

      • How do language and social network properties interact?

      • Natural language processing techniques for social media analysis.

      • Semantic Web/ontologies/domain models to aid in understanding social data.

      • Characterizing participants via linguistic analysis.

      • Language, social media, and human behavior.

      There were several other workshops on similar topics, for example, the Making Sense of Microposts (#Microposts)7 workshop series in conjunction with the World Wide Web Conference 2012–2016. These workshops focused in particular on short informal texts that are published without much effort (such as tweets, Facebook shares, Instagram-like shares, and Google+ messages). There has been another series of workshops on Natural Language Processing for Social Media (SocialNLP) since 2013, with SocialNLP 2017 offered in conjunction with EACL 20178 and IEEE BigData 2017.9

      The intended audience of this book is researchers who are interested in developing tools and applications for automatic analysis of social media texts. We assume that the readers have basic knowledge in the area of natural language processing and machine learning. We hope that this book will help the readers better understand computational linguistics and social media analysis, in particular text mining techniques and NLP applications (such as summarization, localization detection, sentiment and emotion analysis, topic detection, and machine translation) designed specifically for social media texts.

      Atefeh Farzindar and Diana Inkpen

      December 2017

      

Скачать книгу