This is the schedule for
Advanced Seminar in Computational Linguistics: Computational Social Science, Fall 2013.
THIS SCHEDULE IS A WORK IN PROGRESS!
In addition, some topic areas may take longer than expected, so keep
an eye on the class mailing list or e-mail me for "official"
Administrivia, class overview.
Computational Analysis of Perspective
Sentiment analysis background
- Bo Pang and Lillian Lee, Opinion mining and sentiment analysis, Foundations and Trends in Information Retrieval 2(1-2), pp. 1-135, July 2008. For our purposes focus on content from the beginning through Section 3.
- Theresa Wilson and Jan Wiebe, Annotating Opinions in the World Press (SIGDIAL 2003). This is the very early Wiebe and Wilson paper describing the original structure of the MPQA annotation back in 2003, including how to compute interannotator agreement in this context.
- Ron Artstein and Massimo Poesio. 2008. Inter-Coder Agreement for Computational Linguistics, Computational Linguistics 34, 4 (December 2008), 555-596. DOI=10.1162/coli.07-034-R2 http://dx.doi.org/10.1162/coli.07-034-R2.
- Matt Thomas, Bo Pang, and Lillian Lee, Get out the vote: Determining support or opposition from Congressional floor-debate transcripts, Proceedings of EMNLP, pp. 327--335, 2006.
- Related reading:
Linguistically informed analysis
- Sayeed, A. B., Boyd-Graber, J., Rusk, B., & Weinberg, A. (2012, June). Grammatical structures for word-level sentiment detection. In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 667-676). Association for Computational Linguistics.
- Richard Socher, Alex Perelygin, Jean Wu, Jason Chuang, Christopher Manning, Andrew Ng and Christopher Potts, Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank, Conference on Empirical Methods in Natural Language Processing, EMNLP 2013.
- Marta Recasens, Cristian Danescu-Niculescu-Mizil, and Dan Jurafsky. 2013. Linguistic Models for Analyzing and Detecting Biased Language. 2013. Proceedings of ACL 2013.
- Stephan Greene and Philip Resnik, More Than Words: Syntactic Packaging and Implicit Sentiment, NAACL 2009, Boulder, CO, May 31 - June 5, 2009. (See also: Greene (2007) Spin: Lexical Semantics, Transitivity, and the Identification of Implicit Sentiment, unpublished doctoral dissertation.)
- Possibly worth looking at:
Sep 25. Note: Philip is out of town. Also note that the Linguistics Dept has a projector that can be borrowed, which might be particularly useful for this class.
The mechanics of gathering relevant data: hands-on session
- Using Amazon Mechanical Turk
- Using Crowdflower
- Using the Twitter API
- Creating a Facebook app
Computational political science
- Dietram A. Scheufele1 and David Tewksbury, Framing, Agenda Setting, and Priming: The Evolution of Three Media Effects Models, Journal of Communication, Volume 57, Issue 1, pages 9–20, March 2007.
- Chong, Dennis, and James N. Druckman. "Framing theory". Annu. Rev. Polit. Sci. 10 (2007): 103-126.
- Justin Grimmer and Brandon Stewart, Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts, Political Analysis, 2013. 21 (3), 267--297. Excellent recent survey designed for political scientists.
- Brendan O'Connor, David Bamman, and Noah A. Smith. Computational Text Analysis for Social Science: Model Complexity and Assumptions. In Proceedings of the NIPS Workshop on Comptuational Social Science and the Wisdom of Crowds, Sierra Nevada, Spain, December 2011.
- Related reading:
- Brief overview of agenda setting/framing from my 2009 computational social science seminar by Sergei Golitsinsky.
- Highly recommended book on agenda setting from the political science perspective (as distinct from the communications perspective):
Baumgartner and Jones Agendas and Instability in American Politics.
- D. A. Scheufele, Framing as a theory of media effects, Journal of Communication, Volume 49 Issue 1, Pages 103-122.
- Robert Entman, Framing: Toward Clarification of a Fractured Paradigm, Journal of Communication, Volume 43 Issue 4, Pages 51 - 58, 1993.
- Agenda-setting theory, Wikipedia.
- Cardie and Wilkerson, eds., Special issue on Text Annotation for Political Science Research, Journal of Information Technology and Politics, 5:1, 2008.
- Monroe and Schrodt, Introduction to the Special Issue: The Statistical Analysis of Political Text, Political Analysis (2008) 16:351–355, doi:10.1093/pan/mpn017.
- Groseclose, Tim; Milyo, Jeffrey (2005). "A Measure of Media Bias". The Quarterly Journal of Economics (President and Fellows of Harvard College and the Massachusetts Institute of Technology) CXX (4): 1191–1237. (2003 version)
- Gentzkow and Shapiro, 2007: What drives media slant? Evidence from U.S. daily newspapers
- Leskovec et al., Meme-tracking and the dynamics of the news cycle,
Proc. 15th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, 2009. (See also the
MemeTracker site, which uses the ideas in this paper to visualize the news cycle.)
- Heath, Chip, Chris Bell, and Emily Steinberg. Emotional selection in memes: The case of urban legends. Journal of Personality 81 (6): 1028-1041.
Also note that Brendan O'Connor (CMU) will be giving a Computational Linguistics Colloquium talk at 11am in A.V. Williams 3258 that is likely to be relevant to people in this class.
Agendas in political debates
- Amber E. Boydstun, Rebecca A. Glazier, and Claire Phillips, Agenda Control in the 2008 Presidential Debates. American Politics Research, September 2013 vol. 41 no. 5 863-899.
- Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, Deborah Cai, Jennifer Midberry, Yuanxin Wang, "I Want to Talk About, Again, My Record On Energy ...": Modeling Topic Control in Conversations using Speaker-centric Nonparametric Topic Models, Machine Learning Journal, to appear.
- Amber Boydstun, Rebecca Glazier, Matt Pietryka, and Philip Resnik, "Real-Time Reactions to a 2012 Presidential Debate: How the Message, the Messenger, and the Audience Shape Debate Engagement Effects", under review.
Framing and bias
- Tae Yano, Philip Resnik, and Noah A. Smith. 2010. Shedding (a thousand points of) light on biased language. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk (CSLDAMT '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 152-158.
- Justin Gross et al., Testing the Etch-a-Sketch Hypothesis: Measuring Ideological Signaling via Candidates’ Use of Key Phrases. APSA 2013. (Sister paper to Sim et al. targeting a political science audience.)
- Yanchuan Sim et al., Measuring Ideological Proportions in Political Speeches, EMNLP 2013. (CL-focused sister paper to Gross et al.) (See also supplementary material to the paper.)
- Related reading:
- Monroe, Colaresi, and Quinn, 2009: Fightin' words: Lexical feature selection and evaluation for identifying the content of political conflict
- Graeme Hirst, Yaroslav Riabinin, Jory Graham, Party status as a confound in the automatic classification of political speech by ideology, in Statistical Analysis of Textual Data Proceedings of 10th International Conference Journées d’Analyse statistique des Données Textuelles 9-11 June 2010 - Sapienza University of Rome.
- Eric Hardisty, Jordan Boyd-Graber, and Philip Resnik. Modeling Perspective using Adaptor Grammars. Empirical Methods in Natural Language Processing, Cambridge, MA, October 2010.
Bridge: All Politics is
- Amos Tversky and Daniel Kahneman. The Framing of Decisions and the Psychology of Choice. Science, New Series, Vol. 211, No. 4481. (Jan. 30, 1981), pp. 453-458.
- Boydstun and Ledgerwood. Sticky Prospects: Loss Frames Are Cognitively Stickier Than Gain Frames, Journal of Experimental Psychology: General. [This link might only be available from UMD IP addresses]
- Cristian Danescu-Niculescu-Mizil, Lillian Lee, Bo Pang, and Jon Kleinberg. 2012. Echoes of power: language effects and power differences in social interaction. In Proceedings of the 21st international conference on World Wide Web (WWW '12). ACM, New York, NY, USA, 699-708. http://doi.acm.org/10.1145/2187836.2187931
- Possibly interesting related reading:
- Emir Kamenica and Matthew Gentzkow. Bayesian Persuasion, American Economic Review 101 (October 2011): 2590–2615. http://www.aeaweb.org/articles.php?doi=10.1257/aer.101.6.2590.
- Stefano DellaVigna and Matthew Gentzkow, Persuasion: Empirical Evidence, NBER Working Paper No. 15298,
Text-driven computational models in psychology
Language use, personality, and behavior
- Tausczik, Y., & Pennebaker, J.W. (2010). The psychological meaning of words: LIWC and computerized text analysis methods.. Journal of Language and Social Psychology.
- F. Mairesse and M. Walker, Words Mark the Nerds: Computational Models of Personality Recognition through Language, Proceedings of the 28th Annual Conference of the Cognitive Science Society, pp. 543-548. 2006.
- P. Resnik, A. Garron, and R. Resnik, Using Topic Modeling to Improve Prediction of Neuroticism and Depression in College Students, Poster, EMNLP, October 2013.
- Atkins, David C., Timothy N. Rubin, Mark Steyvers, Michelle A. Doeden, Brian R. Baucom, and Andrew Christensen. "Topic models: A novel method for modeling couple and family text data". Journal of Family Psychology 26, no. 5 (2012): 816.
- Related reading:
- Pennebaker, J. W., Chung, C. K., Ireland, M., Gonzales, A. L., & Booth, R. J. (2007). The development and psychometric properties of LIWC2007. Austin, TX: LIWC.net.
- Pennebaker, J. W., & Chung, C. K. (2008). Computerized text analysis of al-Qaeda statements. In K. Krippendorff and M. Bock (Eds.), A content analysis reader (pp. 453-466). Thousand Oaks, CA: Sage.
- Rude, S., Gortner, E.M., and Pennebaker, J.. 2004. Language use of depressed and depression-vulnerable college students. Cognition & Emotion 18:8, 1121-1133; S Stirman and J W Pennebaker. 2001. Word Use in the Poetry of Suicidal and Nonsuicidal Poets. Psychosomatic Medicine 63:517-522).
- Argamon, S., Koppel, M., Pennebaker, J. W., and Schler, J. 2009. Automatically profiling the author of an anonymous text., Commun. ACM 52, 2 (Feb. 2009), 119-123. DOI= http://doi.acm.org/10.1145/1461928.1461959. [Alternative PDF version]
- Emily T. Prud’hommeaux, Brian Roark, Lois M. Black, and Jan van Santen, Classification of atypical language in autism, Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics, 2011.
- De Choudhury, M., Counts, S., & Horvitz, E. (2013). Predicting Postpartum Changes in Emotion and Behavior via Social Media, CHI’13, Paris, France,
- Pakhomov S, Chacon D, Wicklund M, Gundel J., Computerized assessment of syntactic complexity in Alzheimer's disease: a case study of Iris Murdoch's writing, Behav Res Methods. 2011 Mar;43(1):136-44.
- Maider Lehr, Izhak Shafran, Emily Prud'hommeaux and Brian Roark. 2013. Discriminative joint modeling of lexical variation and acoustic confusion for automated narrative retelling assessment. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT).
- Related reading:
- Related reading:
No class. Happy Thanksgiving!
Dec 4 .
- McFarland, Daniel A., Dan Jurafsky, and Craig M. Rawlings. 2013. "Making the Connection: Social Bonding in Courtship Situations". American Journal of Sociology Vol. 118, No. 6, 1596-1649.
- Rajesh Ranganath, Dan Jurafsky, and Daniel A. McFarland. 2013. Detecting friendly, ﬂirtatious, awkward, and assertive speech in speed-dates, Computer Speech and Language. 27:1, 89-115.
- Ireland, Molly E., Richard B. Slatcher, Paul W. Eastwick, Lauren E. Scissors, Eli J. Finkel, and James W. Pennebaker. 2011. Language style matching predicts relationship initiation and stability. Psychological Science 22 (1): 39-44.
- Philip Bramsen, Martha Escobar-Molano, Ami Patel, and Rafael Alonso. 2011. Extracting social power relationships from natural language. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT '11), Vol. 1. Association for Computational Linguistics, Stroudsburg, PA, USA, 773-782.
- Panayiotis G. Georgiou, Matthew P. Black, and Shrikanth S. Narayanan. 2011. Behavioral signal processing for understanding (distressed) dyadic interactions: some recent developments. In Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding (J-HGBU '11). ACM, New York, NY, USA, 7-12. DOI=10.1145/2072572.2072576 http://doi.acm.org/10.1145/2072572.2072576
- Danescu-Niculescu-Mizil, Cristian, Michael Gamon, and Susan Dumais. Mark my words! Linguistic style accommodation in social media. Proceedings of WWW (2011).
- Girju, Roxana. 2010. Toward social causality: An analysis of interpersonal relationships in online blogs and forums. Proceedings of ICWSM, pp. 66--73.
Last class! Discussion of people's projects.
Miscellaneous list of other papers that are potentially of interest
- Jurka, Timothy P., Loren Collingwood, Amber E. Boydstun, Emiliano Grossman, and Wouter van Atteveldt. 2013. RTextTools: A Supervised Learning Package for Text Classification. The R Journal, 5(1): 6-12.
- Tim Hawes, Computational Analysis of the Conversational Dynamics of the United States Supreme Court, UMD Master's thesis. Chapters 1, 2, 4, 6 (Chs 1,6 are very short intro/conclusion), Chapter 5 and Chapter 3 optional (in that order of priority).
- Vasileios Hatzivassiloglou; Kathleen R. McKeown, Predicting the Semantic Orientation of Adjectives (ACL 1997). Important early paper on acquiring what we would now call a sentiment lexicon.
- Turney, P.D. (2002), Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews, Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL'02), Philadelphia, Pennsylvania, 417-424. (Optionally, for a more extensive discussion and related work, see also Turney, P.D., and Littman, M.L. (2003), Measuring praise and criticism: Inference of semantic orientation from association, ACM Transactions on Information Systems (TOIS), 21 (4), 315-346.) Important early work on sentiment classification of reviews.
- Wei-Hao Lin, Theresa Wilson, Janyce Wiebe, and Alexander Hauptmann Which Side are You on? Identifying Perspectives at the Document and Sentence Levels, The Tenth Conference on Natural Language Learning (CoNLL), 2006.
- Optional: Wei-Hao Lin, Identifying Ideological Perspectives in Text and Video, Ph.D. thesis, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, CMU-LTI-08-008, 2008. [Slides] (More generally, see Lin's publications page for a variety of interesting work on identifying ideological perspectives.)
- Predicting response to political blog posts with topic models. In Proceedings of Human Language Technologies: the 2009 Annual Conference of the North American Chapter of the Association For Computational Linguistics (Boulder, Colorado, May 31 - June 05, 2009). Human Language Technology Conference. Association for Computational Linguistics, Morristown, NJ, 477-485.
- Possibly: something on coordination of linguistic style? (http://www.mpi-sws.org/~cristian/) -- and see http://www.cs.cornell.edu/courses/cs6742/2011sp/ Natural Language Processing and Social Interaction
- D. Hopkins and G. King, A method of automated nonparametric content analysis for
social science. American Journal of Political Science, Vol. 54, No. 1, January 2010, Pp. 229–247.
- Brendan O'Connor, Brandon Stewart, and Noah A. Smith. Learning to Extract International Relations from Political Context. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, August 2013.
Philip Resnik, Associate Professor
Department of Linguistics and Institute for Advanced Computer Studies
Department of Linguistics
1401 Marie Mount Hall UMIACS phone: (301) 405-6760
University of Maryland Linguistics phone: (301) 405-8903
College Park, MD 20742 USA Fax: (301) 314-2644 / (301) 405-7104
http://umiacs.umd.edu/~resnik E-mail: resnik AT umd _DOT.GOES.HERE_ edu