Cong, Lin William; Liang, Tengyuan; Zhang, Xiao; Zhu, Wu - National Bureau of Economic Research - 2024
We introduce a general approach for analyzing large-scale text-based data, combining the strengths of neural network language processing and generative statistical modeling to create a factor structure of unstructured data for downstream regressions typically used in social sciences. We generate...