Analyzing Reference Networks Across Philosophical Texts

Over the past year, I worked with Aron Culotta, a well-known professor in AI and natural language processing, to conduct what is currently the largest computational analysis of philosophical texts within the English language. Our work is currently undergoing peer review from the Oxford Journal, "Digital Scholarship in the Humanities."

In our study, we downloaded & processed over 2,000 philosophical texts, and used Python to track how often each author references other authors within the dataset. With these results, we constructed graph representations—with authors as nodes and references as edges—to represent the reference networks mathematically. We also applied transformer models to classify each reference by its surrounding contexts, allowing us to identify specific threads in how philosophical influence has spread.

We applied multiple algorithms from graph theory to analyze these networks—clustering them, identifying paths between nodes, and determining the influence of individual authors. We combined our data analysis with a deep dive into philosophical literature, spanning across both primary and secondary sources to inform our analysis. Finally, we created a visualization tool with the D3 javascript library to allows anyone to interact with and visualize our networks for their own research.

Our paper first analyzes the overall structure of our network. For instance, we use total incoming references and in-degree centrality (incoming references from unique authors) to quantify the level of influence of various philosophers:

Network Structure Analysis

Figure 1: Distribution of incoming references across authors in our dataset.

Network Structure Analysis

Figure 2: In-degree centrality of top 15 authors in our dataset.

We also make similar calculations of subsets of our data which only include references classified into various topics. This allows us to make more precise estimates on the level of influence across different areas within philosophy:

Network Structure Analysis

Figure 3: Percentage of incoming references across topics in philosophy. Shows how the most influential philosophers have impacted different areas.

Finally, we explore the positions of various key figures with our visualization tool, providing additional context on how individuals relate to the wider network:

Network Structure Analysis

Figure 4: Reference networks of top authors with our visualization tool.

Network Structure Analysis

Figure 5: Kant's position in our reference network

If interested, feel free to check out the full paper here, our GitHub repository here, our visualization tool here, or some of my previous work in this area here. Special thanks to Aron Culotta for his guidance and the Tulane computer science department for their support!