DigiNews

Tech Watch by Johan Denoyer

← Back to articles

ICLR 2026 - Institutional Affiliations Dataset & Analysis

Quality: 8/10 Relevance: 9/10

Summary

An end-to-end pipeline converts 5,356 ICLR 2026 papers into a clean PDF-derived institutional-affiliations dataset with a treemap visualization. It avoids author profile drift by parsing the PDF title blocks for affiliations, canonicalizes institution names, and offers multiple counting methods; the project includes steps to reproduce the pipeline and generate public datasets and charts.

🚀 Service construit par Johan Denoyer