Readings

This is the schedule for Advanced Seminar in Computational Linguistics: Computational Social Science, Fall 2015.

THIS SCHEDULE IS A WORK IN PROGRESS!
In addition, some topic areas may take longer than expected, so keep an eye on the class mailing list or e-mail me for "official" dates.

Also note that some links point to pay-for-access publishers, but the links are accessible for free from UMD IP addresses.

Sep 2: Class overview
Sep 9: Big picture questions about computational social science: scientific and social tensions
Sep 16: Ideal point models and the Supreme Court
Sep 23: More on ideal point models
Sep 30: Agenda setting and framing
Oct 7: Agendas and influence in debates and other conversations
Oct 14: Ideological bias
Oct 21: Bridge: All Politics is ~~Local~~ Psychological
Oct 28: Personality
Nov 4: Social media analysis: depression
Nov 11: Clinical assessments of patient language
Nov 18: More on clinical assessments of patient language
Nov 25: No class - Happy Thanksgiving!
Dec 2: Interpersonal Relationships
Dec 9: Project presentations/discussion

Sep 2.

Administrivia, introductions
Language and underlying mental state: motivations and overview

Sep 9.

Big picture questions about computational social science: scientific and social tensions

Before delving into particular methods for computational social science, let's start with a few of the big picture questions. There are two fundamental tensions that arise repeatedly when people talk about computational social science. The first is a scientific tension connected with the idea of using analysis of large, naturally occurring datasets to do science, as contrasted with more traditional theoretical and experimental methods. The second is a social tension connected with the idea that machine learning and statistical modeling can be misleading or, worse, can further institutionalize existing societal biases in the automated infrastructure of the future.

I've organized this week's readings into two groups: readings that we can use to structure the discussion and readings that will inform the discussion. There are a lot of readings listed in the latter group, but none of them have any technical complexity (not even the emotional contagion paper) and most of them are quite short, so please at least look them over.

Readings to structure the discussion

Halevy et al. 2009. The unreasonable effectiveness of data
Boyd and Crawford 2012, Critical questions for big data
Hanna Wallach 2014, Big Data, Machine Learning, and the Social Sciences: Fairness, Accountability, and Transparency, talk at NIPS 2014 workshop on “Fairness, Accountability, and Transparency in Machine Learning.
Justin Grimmer and Brandon Stewart 2013, Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts, Political Analysis, 2013. 21 (3), 267--297.

Readings that should also inform the discussion

Lazer et al. 2009. Computational Social Science. Science.
Peter Norvig, On Chomsky and the Two Cultures of Statistical Learning
Shah et al., Big Data, Digital Media, and Computational Social Science: Possibilities and Perils
Guillory and Hancock (2014), “Experimental evidence of massive-scale emotional contagion through social networks”
Jason Baldridge, Emotional Contagion: Contextualizing the Controversy, blog post, July 15, 2014.
Jason Baldridge, Machine Learning And Human Bias: An Uneasy Pair, TechCrunch, August 2, 2015.
Mark Bergen, ‘Machine Learning Is Hard’: Google Photos Has Egregious Facial Recognition Error
Also of interest: Workshop on Fairness, Accountability, and Transparency in Machine Learning

Sep 16.

Ideal point models and the Supreme Court

The Supreme Court is a fascinating domain of study for computational social science. From a social perspective, Supreme Court decisions literally have the power to shape the future of society. From a scientific perspective, there is a long tradition of research in political science asking fundamental questions about the role of the Supreme Court and the nature of its decision making -- as just one example, do amicus ("friend of the court") briefs actually influence justices' decisions?

With regard to computational social science, the Supreme Court is a great area of study. There are large, available sources of data that include voluminous language of various types (e.g. merits briefs, amicus briefs, majority and minority opinions, oral arguments), along with tons of metadata (vote data, age, party of the appointing president, etc.).

And from the perspective of this seminar, the Supreme Court is a fascinating area of study because in this setting the connection between language and mental state is paramount. What can language tell us about the underlying opinion or ideology of justices or the people arguing the case? To what extent do we see linguistic evidence of influence or persuasion? What approach do opposing sides take to framing the same issue? How might power relationships be reflected in language use?

Appetizers

Oliver Roeder, Why the best Supreme Court predictor in the world is some random guy in Queens, FiveThirtyEight, November 17, 2014.
Sarah Kliff, Robots are better than humans at predicting Supreme Court decisions, Vox, June 11, 2015.
Oliver Roeder, Uh, ‘Robots’ Aren’t Better Than Humans At Predicting Supreme Court Decisions, FiveThirtyEight, June 11, 2015.
Jess Bravin's WSJ article about John Roberts's use of "friend"
Gibbs sampling plays an important role in the papers we're going to cover. If you need a refresher, see Gibbs Sampling for the Uninitiated.

Main course

Andrew D. Martin and Kevin M. Quinn. 2002. "Dynamic Ideal Point Estimation via Markov Chain Monte Carlo for the U.S. Supreme Court, 1953-1999." Political Analysis. 10: 134-153.
- Also of interest: Martin, Quinn, and Epstein (2005), The median justice on the U.S. Supreme Court
- Gerrish and Blei (2012), How They Vote: Issue-Adjusted Models of Legislative Behavior (authors' software; see also this project)
Yano, Smith, and Wilkerson, 2012, Textual predictors of bill survival in congressional committees (focuses on a closely related vote prediction task)

Some additional notes on ideal point models

For those who are helped a lot by understanding the intuitive basis for the model, the first section of Sim et al. (2015) has a nice summary of previous ideas (see next week's readings). See also Section 1 of Bafumi et al. http://www.stat.columbia.edu/~gelman/research/published/171.pdf, which has a nice explanation of what this kind of model is doing.
For those who want to see the mathematical discussion in a little more detail, Clinton et al. http://politics.as.nyu.edu/docs/IO/4756/jackman_nemp.pdf (Section 3) fleshes out the discussion in Martin and Quinn Section 3.1 where they formalize the justice's decision process. See also Clinton et al. (2004), http://www.cs.princeton.edu/courses/archive/fall09/cos597A/papers/ClintonJackmanRivers2004.pdf.
Ideal point models in political science are related to item response theory (IRT), which is discussed in the educational assessment literature: probability of a yes/no vote as related to ideological point, in politics, is analogous to the probability of giving a correct answer on a test as related to your ability. There is a nice discussion of IRT at Partchev (2004), https://www.metheval.uni-jena.de/irt/VisualIRT.pdf; see in particular the 2PL model (Section 5). Slides 22-23 at http://jonathantemplin.com/files/irt/irt11icpsr/irt11icpsr_lecture14.pdf derive the form of the model we're looking at from the 2PL IRT model.

Data resources

We have access to pretty much anything one could ask for with regard to the Supreme Court: judge-case-level metadata, case-level metadata, processed opinion content, merits briefs, amicus briefs, transcripts of oral arguments. Here are a few useful links.

Supreme Court Database.
Supreme Court Database (SCDB) metadata documentation
Supreme Court Dialogs Corpus

More generally, there are lots of really interesting sources out there for code and data. One nice compendium I've found is from Bicoastal Datafest: analyzing money's influence on politics, which includes a nice list of well defined project ideas as well as pointers to projects that were done, along with great lists of data and tools.

Sep 23.

More on ideal point models and the Supreme Court

This week we will continue with last week's topic, the Supreme Court.

Sep 30.

Agenda setting and framing

Two central concepts in the study of communications, particularly political communication, are agenda setting and framing. Roughly speaking, agenda setting is about what content gets brought to public attention, traditionally with a focus on political elites and the media as communicators; the classic quote from Cohen (1963) reads that the press the press "may not be successful much of the time in telling people what to think, but it is stunningly successful in telling its readers what to think about. The world will look different to different people depending on the map that is drawn for them by writers, editors, and publishers of the paper they read."

Framing, on the other hand, is not about what gets talked about but how; Entman (1993) writes that framing "framing essentially involves selection and salience. To frame is to select some aspects of a perceived reality and make them more salient in a communicating text, in such a way as to promote a particular problem definition, causal interpretation, moral evaluation, and/or treatment recommendation for the item described."

There is a truly extensive literature on both of these topics -- enough for an entire course in itself. This week we will get familiar with one widely cited discussion, Sheufele's article considering these concepts from the perspective of cognitive effects. Then we'll look at two computational papers related to these concepts. Related to agenda setting, we discuss Leskovec et al., which was a very innovative development in understanding how "memes" get on the radar in news and blogs. Related to framing, we cover Nguyen et al., who develop a model extending the idea of using topic models to define ideal point dimensions (cf. Lauderdale and Clark, last week) to a hierarchical topic model inspired by the treatment of framing as second level agenda setting.

Readings

Dietram A. Scheufele (2000), Agenda-Setting, Priming, and Framing Revisited: Another Look at Cognitive Effects of Political Communication, Mass Communication and Society, 3:2-3, 297-316, DOI: 10.1207/S15327825MCS0323_07.
Leskovec et al., Meme-tracking and the dynamics of the news cycle, Proc. 15th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, 2009. (See also the MemeTracker site, which uses the ideas in this paper to visualize the news cycle.)
Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, and Kristina Miler, Tea Party in the House: A Hierarchical Ideal Point Topic Model and Its Application to Republican Legislators in the 112th Congress, ACL 2015. (I may circulate a pre-publication manuscript extending this paper to journal length.)

Also of interest:

Baumgartner and Jones Agendas and Instability in American Politics.
Dietram A. Scheufele and David Tewksbury, Framing, Agenda Setting, and Priming: The Evolution of Three Media Effects Models, Journal of Communication, Volume 57, Issue 1, pages 9–20, March 2007.
Groseclose, Tim; Milyo, Jeffrey (2005). "A Measure of Media Bias". The Quarterly Journal of Economics (President and Fellows of Harvard College and the Massachusetts Institute of Technology) CXX (4): 1191–1237. (2003 version)
- Liberman, Mark (2005-12-22). "Linguistics, politics, mathematics". Language Log.
- Liberman, Mark (2005-12-23). "Multiplying ideologies considered harmful". Language Log.
Gentzkow and Shapiro, 2007: What drives media slant? Evidence from U.S. daily newspapers
Puglisi, Riccardo and Snyder, James, (2011), The Balanced U.S. Press, No 17263, NBER Working Papers, National Bureau of Economic Research, Inc.
Justin Grimmer, A Bayesian Hierarchical Topic Model for Political Texts: Measuring Expressed Agendas in Senate Press Releases, Political Analysis, 2010
Amber E. Boydstun, Anne Hardy, and Stefaan Walgrave, Two Faces of Media Attention: Media Storm Versus Non-Storm Coverage, Political Communication, 31:509–531, 2014.
Framing
Chong and Druckman (2007), Framing Theory, Annu. Rev. Polit. Sci. 2007. 10:103–26.
Dallas Card, Amber E. Boydstun, Justin H. Gross, Philip Resnik, and Noah A. Smith.. The Media Frames Corpus: Annotations of Frames Across Issues. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2015), Beijing, China, July 2015.
Boydstun, Card, Gross, Resnik, and Smith, Tracking the Development of Media Frames within and across Policy Issues

October 7.

Agendas and influence in debates and other conversations

Prabhakaran, Vinodkumar, Ashima Arora, and Owen Rambow. Power of confidence: How poll scores impact topic dynamics in political debates, ACL 2014 (2014): 49.
- Also of interest: The SITS model in Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, Deborah Cai, Jennifer Midberry, Yuanxin Wang, "I Want to Talk About, Again, My Record On Energy ...": Modeling Topic Control in Conversations using Speaker-centric Nonparametric Topic Models, Machine Learning Journal. (code) For fuller discussion see Viet-An Nguyen's 2015 dissertation.
Danescu-Niculescu-Mizil et al., Echoes of power: Language effects and power differences in social interaction
Tim Hawes, Computational Analysis of the Conversational Dynamics of the United States Supreme Court. (Quickly take a look at Chapter 3, or read the published version in Timothy Hawes, Jimmy Lin, and Philip Resnik, Elements of a computational model for multi-party discourse: The turn-taking behavior of Supreme Court justices. JASIST 60(8), 2009. [corpus] The interesting material is in Chapters 4 and 5.

Also of interest

Amber E. Boydstun, Rebecca A. Glazier, and Claire Phillips, Agenda Control in the 2008 Presidential Debates. American Politics Research, September 2013 vol. 41 no. 5 863-899.
Amber Boydstun, Rebecca Glazier, Matt Pietryka, and Philip Resnik, Real-Time Reactions to a 2012 Presidential Debate: A Method for Understanding Which Messages Matter, Public Opinion Quarterly, January 1, 2014 78: 330-343.
Annotated transcript of the August 6, 2015 Republican primary debate, with commentary from Washington Post reporters and readers.

October 14.

Ideological bias

Readings

Sim, Yanchuan, Brice DL Acree, Justin H. Gross, and Noah A. Smith. Measuring ideological proportions in political speeches. In Proceedings of EMNLP. 2013.
Iyyer, Mohit, Peter Enns, Jordan Boyd-Graber, and Philip Resnik. Political ideology detection using recursive neural networks. In Association for Computational Linguistics. 2014.

Oct 21.

Bridge: All Politics is Local Psychological

Amos Tversky and Daniel Kahneman, The Framing of Decisions and the Psychology of Choice. Science, New Series, Vol. 211, No. 4481. (Jan. 30, 1981), pp. 453-458.
- When you have the chance, I very strongly recommend reading Daniel Kahneman's book, Thinking, Fast and Slow. As a substitution, this book summary is of course inferior to the original but does get the basic ideas across.
Chong and Druckman (2007), Framing Theory, Annu. Rev. Polit. Sci. 2007. 10:103–26.
Boydstun and Ledgerwood. Sticky Prospects: Loss Frames Are Cognitively Stickier Than Gain Frames, Journal of Experimental Psychology: General. [This link might only be available from UMD IP addresses]
Possibly interesting related reading:
- Emir Kamenica and Matthew Gentzkow. Bayesian Persuasion, American Economic Review 101 (October 2011): 2590–2615. http://www.aeaweb.org/articles.php?doi=10.1257/aer.101.6.2590.
- Stefano DellaVigna and Matthew Gentzkow, Persuasion: Empirical Evidence, NBER Working Paper No. 15298, August 2009.

Oct 28.

Personality

One of the better studied applications of language analysis to psychology is the assessment of personality.

Tausczik, Y., & Pennebaker, J.W. (2010). The psychological meaning of words: LIWC and computerized text analysis methods.. Journal of Language and Social Psychology.
F. Mairesse and M. Walker, Words Mark the Nerds: Computational Models of Personality Recognition through Language, Proceedings of the 28th Annual Conference of the Cognitive Science Society, pp. 543-548. 2006.
- If you'd like to read a more in-depth discussion of this line of work, see F. Mairesse, M. Walker, M. Mehl, and R. Moore. 2007. Using linguistic cues for the automatic recognition of personality in conversation and text. Journal of Artificial Intelligence Research (JAIR), 30:457-500.
- For a neat application of this work to natural language generation, see François Mairesse and Marilyn Walker, Controlling User Perceptions of Linguistic Style: Trainable Generation of Personality Traits.. Computational Linguistics, 2011.
Park, G., Schwartz, H. A., Eichstaedt, J. C., Kern, M. L., Kosinski, M., Stillwell, D. J., Ungar, L. H., & Seligman, M. E. P. (2014, November 3). Automatic Personality Assessment Through Social Media Language. Journal of Personality and Social Psychology. Advance online publication. http://dx.doi.org/10.1037/pspp0000020

Also of interest

Dietrich, Bryce J., Scott Lasley, Jeffery J. Mondak, Megan L. Remmel, and Joel Turner. Personality and legislative politics: The Big Five trait dimensions among US state legislators. Political Psychology 33, no. 2 (2012): 195-210.
P. Resnik, A. Garron, and R. Resnik, Using Topic Modeling to Improve Prediction of Neuroticism and Depression in College Students, Poster, EMNLP, October 2013.
Pennebaker, J.W., & Chung, C.K., Computerized text analysis of Al-Qaeda transcripts. In K. Krippendorff & M. Bock (Eds.), A Content Analysis Reader.

Nov 4.

Social media analysis: depression

Mental health problems are among the most pressing challenges we face. The numbers in the U.S. alone are staggering: to cite just a few, between 1996 and 2011, annual expenditures on mental disorders rose from $35.2B to $113B, some 25 million American adults will have an episode of major depression this year, suicide is the third leading cause of death for people between 10 and 24 years old, and 89.3 million Americans live in federally-designated Mental Health Professional Shortage Areas.

The papers this week look at the ability to identify depression based on people's behavior in social media. Issues to watch for... How is ground truth being defined? What signals are being included as potential features? How does the experimental setting relate to what one would actually need to accomplish for this to be useful in the real world?

De Choudhury, Munmun, Scott Counts, Eric J. Horvitz, and Aaron Hoff. "Characterizing and predicting postpartum depression from shared facebook data." In Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing, pp. 626-638. ACM, 2014.
De Choudhury, Munmun, Michael Gamon, Scott Counts, and Eric Horvitz. "Predicting Depression via Social Media." In ICWSM. 2013.
Resnik, Philip, William Armstrong, Leonardo Claudino, Thang Nguyen, Viet-An Nguyen, and Jordan Boyd-Graber. "Beyond LDA: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter." In Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology (CLPsych). 2015.
- UMD group's shared task paper (updated nuts-and-bolts description plus official shared task results, worth at least skimming): Resnik, Philip, et al. "The University of Maryland CLPsych 2015 shared task system." Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, Denver, Colorado, USA, June. North American Chapter of the Association for Computational Linguistics. 2015.

Also of interest:

Overview of the CLPsych 2015 shared task: Coppersmith et al., CLPsych 2015 Shared Task: Depression and PTSD on Twitter, Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, pages 31–39,Denver, Colorado, June 5, 2015.
Using LIWC to predict depression: Stephanie S. Rude, Eva-Maria Gortner, and James W. Pennebaker. 2004. Language use of depressed and depression-vulnerable college students.. Cognition & Emotion, 18(8):1121–1133.
An avatar-based intervention, which could be a setting where NLP could play a role: Pinto, Melissa D., Amy M. Greenblatt, Ronald L. Hickman, Heather M. Rice, Tami L. Thomas, and John M. Clochesy. "Assessing the Critical Parameters of eSMART‐MH: A Promising Avatar‐Based Digital Therapeutic Intervention to Reduce Depressive Symptoms." Perspectives in psychiatric care (2015).

Nov 11.

Clinical assessments of patient language

Last week we focused on social media analysis and what it can tell us about people's mental state, particularly with respect to depression. This week we shift to the analysis of language collected in clinical settings, such as therapist-patient sessions, cognitive assessment tasks, and patient interviews.

Clinician-patient interactions

Imel, Z. E., Steyvers, M., & Atkins, D. C. (2014, May 26). Computational Psychotherapy Research: Scaling up the Evaluation of Patient–Provider Interactions. Psychotherapy. Advance online publication. http://dx.doi.org/10.1037/a0036841
Also of interest:
- Atkins, David C., Mark Steyvers, Zac E. Imel, and Padhraic Smyth. "Scaling up the evaluation of psychotherapy: evaluating motivational interviewing fidelity via statistical text classification." Implementation Science 9, no. 1 (2014): 49.
- Tanana, Michael, Kevin Hallgren, Zac Imel, David Atkins, Padhraic Smyth, and Vivek Srikumar. "Recursive Neural Networks for Coding Therapist and Patient Behavior in Motivational Interviewing.". Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, pages 71–79, Denver, Colorado, June 5, 2015.

Psychosis

Bedi, Gillinder et al., Automated analysis of free speech predicts psychosis onset in high-risk youths, NPJ Schizophrenia, 2015.
Editorial Summary: Diagnostics: Automated speech analysis predicts later psychosis. A computer program that analyses natural speech could help predict the onset of psychosis in young people at risk. People with schizophrenia have subtle disorganization in speech, even before they first develop psychosis. In a collaboration between IBM, Columbia University Medical Center, and researchers in South America, an automated program that simulates how the human brain understands language was used to analyze interview transcripts from 34 ‘at risk’ youths. Decrease in the flow of meaning from one spoken phrase to the next, and grammatical markers of speech complexity, identified the five individuals who later developed psychosis. The computer program outperformed clinical assessments in predicting psychosis. While numbers are small in this proof-of-principle study, the authors suggest automated analysis could lay the foundation for a simple clinical test of emerging schizophrenia, which would inform early intervention.

Nov 18.

More on clinical assessments of patient language

Given interests from last week we'll talk more about clinical assessments.

Interpersonal relationships

Ireland, Molly E., Richard B. Slatcher, Paul W. Eastwick, Lauren E. Scissors, Eli J. Finkel, and James W. Pennebaker. 2011. Language style matching predicts relationship initiation and stability. Psychological Science 22 (1): 39-44.
Ranganath, R., Jurafsky, D., & McFarland, D. (2009, August). It's not you, it's me: detecting flirting and its misperception in speed-dates. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1 (pp. 334-342). Association for Computational Linguistics. Chicago.
Girju, Roxana. 2010. Toward social causality: An analysis of interpersonal relationships in online blogs and forums. Proceedings of ICWSM, pp. 66--73.
Philip Bramsen, Martha Escobar-Molano, Ami Patel, and Rafael Alonso. 2011. Extracting social power relationships from natural language. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT '11), Vol. 1. Association for Computational Linguistics, Stroudsburg, PA, USA, 773-782.

Also of interest; in particular look over the first two if you have time.

McFarland, D. A., Jurafsky, D., & Rawlings, C. (2013). Making the Connection: Social Bonding in Courtship Situations. American Journal of Sociology, 118(6), 1596-1649.
Rajesh Ranganath, Dan Jurafsky, and Daniel A. McFarland. 2013. Detecting friendly, ﬂirtatious, awkward, and assertive speech in speed-dates, Computer Speech and Language. 27:1, 89-115.
Panayiotis G. Georgiou, Matthew P. Black, and Shrikanth S. Narayanan. 2011. Behavioral signal processing for understanding (distressed) dyadic interactions: some recent developments. In Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding (J-HGBU '11). ACM, New York, NY, USA, 7-12. DOI=http://dx.doi.org/10.1145/2072572.2072576
Voigt, Rob, Robert J. Podesva, and Dan Jurafsky. "Speaker movement correlates with prosodic indicators of engagement". In Speech Prosody, vol. 7. 2014.