Beautiful visualizations of how language differs among document types.
-
Updated
Apr 29, 2025 - Python
Beautiful visualizations of how language differs among document types.
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
Text analysis with networks.
Interpretable data visualizations for understanding how texts differ at the word level
A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)
An Automation Webcrawler for Extracting Central Bankers' Speeches
Code for collecting and cleaning speeches (text) of the US 2020 election campaign. Corresponding publication: "A text dataset of campaign speeches of the main tickets in the 2020 US presidential election", by Ioannis Chalkiadakis, Louise Anglès d’Auriac, Gareth W. Peters, and Divina Frau-Meigs
Add a description, image, and links to the text-as-data topic page so that developers can more easily learn about it.
To associate your repository with the text-as-data topic, visit your repo's landing page and select "manage topics."