Justin Grimmer et al., "Text as Data: A New Framework for Machine Learning and the Social Sciences" (Princeton UP, 2022)

56:38
 
分享
 

Manage episode 340190922 series 2917053
由Player FM以及我们的用户群所搜索的Marshall Poe and New Books Network — 版权由出版商所拥有,而不是Player FM,音频直接从出版商的伺服器串流. 点击订阅按钮以查看Player FM更新,或粘贴收取点链接到其他播客应用程序里。

From social media posts and text messages to digital government documents and archives, researchers are bombarded with a deluge of text reflecting the social world. This textual data gives unprecedented insights into fundamental questions in the social sciences, humanities, and industry. Meanwhile new machine learning tools are rapidly transforming the way science and business are conducted. Text as Data shows how to combine new sources of data, machine learning tools, and social science research design to develop and evaluate new insights.

Text as Data: A New Framework for Machine Learning and the Social Sciences (Princeton UP, 2022) is organized around the core tasks in research projects using text--representation, discovery, measurement, prediction, and causal inference. The authors offer a sequential, iterative, and inductive approach to research design. Each research task is presented complete with real-world applications, example methods, and a distinct style of task-focused research.

Bridging many divides--computer science and social science, the qualitative and the quantitative, and industry and academia--Text as Data is an ideal resource for anyone wanting to analyze large collections of text in an era when data is abundant and computation is cheap, but the enduring challenges of social science remain.

  • Overview of how to use text as data
  • Research design for a world of data deluge
  • Examples from across the social sciences and industry

Peter Lorentzen is economics professor at the University of San Francisco. He heads USF's Applied Economics Master's program, which focuses on the digital economy. His research is mainly on China's political economy.

448集单集