Presentations

An Improved Method of Automated Nonparametric Content Analysis for Social Science, at New York University, Text as Data Speaker Series, Thursday, December 1, 2016:

A vast literature in computer science and statistics develops methods to automatically classify textual documents into chosen categories. In contrast, social scientists are often more interested in aggregate generalizations about populations of documents --- such as the percent of social media posts that speak favorably of a candidate's foreign policy. Unfortunately, trying to maximize the percent of individual documents correctly classified often yields biased estimates of statistical aggregates. Fortunately, classification is neither a necessary nor even a desirable step in estimating Read more about An Improved Method of Automated Nonparametric Content Analysis for Social Science

How the Chinese Government Fabricates Social Media Posts for Strategic Distraction, not Engaged Argument, at Pacific Information Operations Symposium, Honolulu, Tuesday, November 8, 2016:

This talk is based on this paper, by me, Jennifer Pan, and Margaret Roberts, along with a brief summary of our previous work. Here's an abstract: The Chinese government has long been suspected of hiring as many as 2,000,000 people to surreptitiously insert huge numbers of pseudonymous and other deceptive writings into the stream of real social media posts, as if they were the genuine opinions of ordinary people. Many academics, and most journalists and activists, claim that these so-called "50c party" posts vociferously argue for the government's Read more about How the Chinese Government Fabricates Social Media Posts for Strategic Distraction, not Engaged Argument

Big Data is Not About the Data!, at University of Michigan, Friday, October 7, 2016:

The spectacular progress the media describes as "big data" has little to do with the data.  Data, after all, is becoming commoditized, less expensive, and an automatic byproduct of other changes in organizations and society. More data alone doesn't generate insights; it often merely makes data analysis harder. The real revolution isn't about the data, it is about the stunning progress in the statistical and other methods of extracting insights from the data. I illustrate these points with a wide range of examples from research I've participated in, including forecasting the solvency of Read more about Big Data is Not About the Data!

Big Data is Not About the Data!, at Michigan State University, Thursday, October 6, 2016:

The spectacular progress the media describes as "big data" has little to do with the data.  Data, after all, is becoming commoditized, less expensive, and an automatic byproduct of other changes in organizations and society. More data alone doesn't generate insights; it often merely makes data analysis harder. The real revolution isn't about the data, it is about the stunning progress in the statistical and other methods of extracting insights from the data. I illustrate these points with a wide range of examples from research I've participated in, including forecasting the solvency Read more about Big Data is Not About the Data!

How the Chinese Government Fabricates Social Media Posts for Strategic Distraction, not Engaged Argument, at Northeastern University, Monday, September 26, 2016:

This talk is based on this paper, by me, Jennifer Pan, and Margaret Roberts, along with a brief summary of our previous work. Here's an abstract: The Chinese government has long been suspected of hiring as many as 2,000,000 people to surreptitiously insert huge numbers of pseudonymous and other deceptive writings into the stream of real social media posts, as if they were the genuine opinions of ordinary people. Many academics, and most journalists and activists, claim that these so-called ``50c party Read more about How the Chinese Government Fabricates Social Media Posts for Strategic Distraction, not Engaged Argument

Big Data is Not About the Data! , at Venice, Italy, Friday, September 23, 2016:

The spectacular progress the media describes as "big data" has little to do with the data.  Data, after all, is becoming commoditized, less expensive, and an automatic byproduct of other changes in organizations and society. More data alone doesn't generate insights; it often merely makes data analysis harder. The real revolution isn't about the data, it is about the stunning progress in the statistical and other methods of extracting insights from the data. We illustrate these points with a wide range of examples from his research Read more about Big Data is Not About the Data!

Big Data is Not About the Data! The Power of Modern Analytics, at Civil Service College, Singapore, Friday, August 19, 2016:

The spectacular progress the media describes as "big data" has little to do with the data.  Data, after all, is becoming commoditized, less expensive, and an automatic byproduct of other changes in organizations and society. More data alone doesn't generate insights; it often merely makes data analysis harder. The real revolution isn't about the data, it is about the stunning progress in the statistical and other methods of extracting insights from the data. We illustrate these points with a wide range of examples from his research Read more about Big Data is Not About the Data! The Power of Modern Analytics

How the Chinese Government Fabricates Social Media Posts for Strategic Distraction, not Engaged Argument, at DARPA, Monday, July 11, 2016:

This talk based on this paper, by me, Jennifer Pan, and Margaret Roberts, with this abstract: The Chinese government has long been suspected of hiring as many as 2,000,000 people to surreptitiously insert huge numbers of pseudonymous and other deceptive writings into the stream of real social media posts, as if they were the genuine opinions of ordinary people. Many academics, and most journalists and activists, claim that these so-called ``50c party'' posts vociferously argue for the government's side in political and policy debates. As we show, Read more about How the Chinese Government Fabricates Social Media Posts for Strategic Distraction, not Engaged Argument

Introduction to Perusall, at Webinar, Tuesday, April 5, 2016:

Perusall is a new collaborative e­book platform that keeps students on track before class. Perusall ensures students learn more, get instant answers to their questions, come to class prepared (with >90% having done the reading), and enjoy the experience. It enables instructors to teach more effectively, understand student misconceptions, structure class discussion, and save time. Perusall is free. For publishers and authors, it is the ultimate solution to IP piracy, resales, and sell-through. Perusall is based on extensive patent-­pending behavioral, data Read more about Introduction to Perusall

Simplifying Matching Methods for Causal Inference, at University of Pennsylvania, APPC, Friday, April 1, 2016:

In this talk, Gary King introduces methods of matching for causal inference that are simpler, more powerful, and easier to understand than prior approaches. Software is available to implement everything discussed. Copies of some of his papers on the subject are available at his web site GaryKing.org.

Pages