Content area
Full Text
Lang Resources & Evaluation (2013) 47:919944 DOI 10.1007/s10579-012-9211-2
ORIGINAL PAPER
Dragomir R. Radev Pradeep Muthukrishnan Vahed Qazvinian
Amjad Abu-Jbara
Published online: 6 January 2013 Springer Science+Business Media Dordrecht 2013
Abstract We introduce the ACL Anthology Network (AAN), a comprehensive manually curated networked database of citations, collaborations, and summaries in the eld of Computational Linguistics. We also present a number of statistics about the network including the most cited authors, the most central collaborators, as well as network statistics about the paper citation, author citation, and author collaboration networks.
Keywords ACL Anthology Network Bibliometrics Scientometrics
Citation analysis Citation summaries
1 Introduction
The ACL Anthology1 is one of the most successful initiatives of the Association for Computational Linguistics (ACL). The ACL is a society for people working on problems involving natural language and computation. It was initiated by Steven Bird (2008) and is now maintained by Min Yen Kan. It includes all papers published by ACL and related organizations as well as the Computational Linguistics journal over a period of four decades.
ACL Anthology has a major limitation in that it is just a collection of papers. It does not include any citation information or any statistics about the productivity of the various researchers who contributed papers to it. We embarked on an ambitious initiative to manually annotate the entire Anthology and curate the ACL Anthology Network (AAN).2
1 http://www.aclweb.org/anthology-new/
Web End =http://www.aclweb.org/anthology-new/ .
2 http://clair.si.umich.edu/anthology/
Web End =http://clair.si.umich.edu/anthology/ .
D. R. Radev P. Muthukrishnan V. Qazvinian (&) A. Abu-Jbara
Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, USAe-mail: [email protected]
The ACL anthology network corpus
123
920 D. R. Radev et al.
Table 1 Statistics of AAN2011 release Number of papers 18,290
Number of authors 14,799 Number of venues 341 Number of paper citations 84,237 Citation network diameter 22 Collaboration network diameter 15 Number of citing sentences 77,753
AAN was started in 2007 by our group at the University of Michigan (Radev et al. 2009a, b). AAN provides citation and collaboration networks of the articles included in the ACL Anthology (excluding book reviews). AAN also includes rankings of papers and authors based on their centrality statistics in the citation and collaboration networks, as well as the citing sentences associated with each citation...