Bibliographic coupling

Summary

Bibliographic coupling, like co-citation, is a similarity measure that uses citation analysis to establish a similarity relationship between documents. Bibliographic coupling occurs when two works reference a common third work in their bibliographies. It is an indication that a probability exists that the two works treat a related subject matter.[1]

Two documents are bibliographically coupled if they both cite one or more documents in common. The "coupling strength" of two given documents is higher the more citations to other documents they share. The figure to the right illustrates the concept of bibliographic coupling. In the figure, documents A and B both cite documents C, D and E. Thus, documents A and B have a bibliographic coupling strength of 3 - the number of elements in the intersection of their two reference lists.

Similarly, two authors are bibliographically coupled if the cumulative reference lists of their respective oeuvres each contain a reference to a common document, and their coupling strength also increases with the citations to other documents that their share. If the cumulative reference list of an author's oeuvre is determined as the multiset union of the documents that the author has co-authored, then the author bibliographic coupling strength of two authors (or more precisely, of their oeuvres) is defined as the size of the multiset intersection of their cumulative reference lists, however.[2]

Bibliographic coupling can be useful in a wide variety of fields, since it helps researchers find related research done in the past. On the other hand, two documents are co-cited if they are both independently cited by one or more documents.

History edit

The concept of bibliographic coupling was introduced by M. M. Kessler of MIT in a paper published in 1963,[3] and has been embraced in the work of the information scientist Eugene Garfield.[4] It is one of the earliest citation analysis methods for document similarity computation and some have questioned its usefulness, pointing out that two works may reference completely unrelated subject matter in the third. Furthermore, bibliographic coupling is a retrospective similarity measure,[5] meaning the information used to establish the similarity relationship between documents lies in the past and is static, i.e. bibliographic coupling strength cannot change over time, since outgoing citation counts are fixed.

The co-citation analysis approach introduced by Henry Small and published in 1973 addressed this shortcoming of bibliographic coupling by considering a document's incoming citations to assess similarity, a measure that can change over time. Additionally, the co-citation measure reflects the opinion of many authors and thus represents a better indicator of subject similarity.[6]

In 1972 Robert Amsler published a paper[7] describing a measure for determining subject similarity between two documents by fusing bibliographic coupling and co-citation analysis.[8]

In 1981 Howard White and Belver Griffith introduced author co-citation analysis (ACA).[9] Not until 2008 did Dangzhi Zhao and Andreas Strotmann combine their work and that of M. M. Kessler to define author bibliographic coupling analysis (ABCA), noting that as long as authors are active this metric is not static and that it is particularly useful when combined with ACA.[2]

More recently, in 2009, Gipp and Beel introduced a new approach termed Co-citation Proximity Analysis (CPA). CPA is based on the concept of co-citation, but represents a refinement to Small's measure in that CPA additionally considers the placement and proximity of citations within a document's full-text. The assumption is that citations in closer proximity are more likely to exhibit a stronger similarity relationship.[10]

In summary, a chronological overview of citation analysis methods includes:

  • Bibliographic coupling (1963)
  • Co-citation analysis (published 1973)
  • Amsler measure (1972)
  • Author co-citation analysis (1981)
  • Author bibliographic coupling analysis (2008)
  • Co-citation proximity analysis (CPA) (2009)

Applications edit

Online sites that make use of bibliographic coupling include The Collection of Computer Science Bibliographies Archived 2011-06-07 at the Wayback Machine and CiteSeer.IST

See also edit

Notes edit

  1. ^ Martyn, J (1964). "Bibliographic coupling". Journal of Documentation. 20 (4): 236. doi:10.1108/eb026352.
  2. ^ a b Zhao, D.; Strotmann, A. (2008). "Evolution of research activities and intellectual influences in information science 1996–2005: Introducing author bibliographic-coupling analysis". Journal of the American Society for Information Science and Technology. 59 (13): 2070–2086. doi:10.1002/asi.20910.
  3. ^ "Bibliographic coupling between scientific papers," American Documentation 24 (1963), pp. 123-131.
  4. ^ See for example "Multiple Independent Discovery and Creativity in Science," Current Contents, Nov. 3, 1980, pp. 5-10, reprinted in Essays of an Information Scientist, vol. 4 (1979-80), pp. 660-665.
  5. ^ Garfield Eugene, 2001.From Bibliographic Coupling to Co-Citation Analysis via Algorithmic Historio-Bibliography presented at Drexel University, Philadelphia, PA
  6. ^ Henry Small, 1973. "Co-citation in the scientific literature: A new measure of the relationship between two documents" Archived 2012-12-02 at the Wayback Machine. Journal of the American Society for Information Science (JASIS), volume 24(4), pp. 265-269. doi = 10.1002/asi.4630240406
  7. ^ Robert Amsler, Dec. 1972 "Applications of citation-based automatic classification", Linguistics Research Center, University Texas at Austin, Technical Report 72-14.
  8. ^ Class Amsler written by Bruno Martins and developed by the XLDB group of the Department of Informatics of the Faculty of Sciences of the University of Lisbon in Portugal
  9. ^ White, Howard D.; Griffith, Belver C. (1981). "Author Cocitation: A Literature Measure of Intellectual Structure". Journal of the American Society for Information Science. 32 (3): 163–171. doi:10.1002/asi.4630320302.
  10. ^ Bela Gipp and Joeran Beel, 2009 Citation Proximity Analysis (CPA) – A new approach for identifying related work based on Co-Citation Analysis in Proceedings of the 12th international conference on scientometrics and informetrics (issi’09), Rio de Janeiro (Brazil), 2009, pp. 571-575.

References edit

Bibliographic Coupling edit

  • Kessler, M. M. (1963). "Bibliographic coupling between scientific papers". American Documentation. 14 (1): 10–25. doi:10.1002/asi.5090140103.
  • Kessler, M. M. (1963). "An experimental study of bibliographic coupling between technical papers". IEEE Transactions on Information Theory. 9 (1): 49. doi:10.1109/tit.1963.1057800.

Author Bibliographic Coupling edit

  • Zhao, D.; Strotmann, A. (2008). "Evolution of research activities and intellectual influences in information science 1996–2005: Introducing author bibliographic-coupling analysis". Journal of the American Society for Information Science and Technology. 59 (13): 2070–2086. doi:10.1002/asi.20910.

Co-citation analysis edit

  • Small, Henry (1973). "Co-citation in the scientific literature: a new measure of the relationship between two documents". Journal of the American Society for Information Science. 24 (4): 265–269. doi:10.1002/asi.4630240406. S2CID 17845928.
  • Small, Henry; Griffith, B. C. (1974). "The structure of scientific literatures (I) Identifying and graphing specialties". Science Studies. 4 (1): 17–40. doi:10.1177/030631277400400102. S2CID 146684402.
  • Griffith, B. C.; et al. (1974). "The structure of scientific literatures (II) Towards a macro- and micro-structure for science". Science Studies. 4 (4): 339–365. doi:10.1177/030631277400400402. S2CID 145811357.
  • Collins, H. M. (1974). "The TEA set: Tacit knowledge and scientific networks". Science Studies. 4 (2): 165–186. doi:10.1177/030631277400400203. S2CID 26917303.

Co-citation Proximity Analysis (CPA) edit

  • Bela Gipp, (Co-)Citation Proximity Analysis – A Measure to Identify Related Work, Feb., 2006. Doctoral Proposal, VLBA-Lab, Otto-von-Guericke University, Magdeburg, Supervisor: Prof. Claus Rautenstrauch
  • Gipp, Bela; Beel, Joeran (2006). "Citation Proximity Analysis (CPA) – A New Approach for Identifying Related Work Based on Co-Citation Analysis" (PDF). Proceedings of the 12th International Conference on Scientometrics and Informetrics (ISSI'09). Rio de Janeiro, Brazil, 2009.
  • Gipp, Bela; Taylor, Adriana; Beel, Joeran (2010). "Link Proximity Analysis - Clustering Websites by Examining Link Proximity" (PDF). In Lalmas M.; Jose J.; Rauber A.; Sebastiani F.; Frommholz I. (eds.). Research and Advanced Technology for Digital Libraries. ECDL 2010. Lecture Notes in Computer Science. Vol. 6273. Springer.

Author Co-citation Analysis (ACA) edit

  • White, H. D.; Griffith, B. C. (1981). "Author co-citation: a literature measure of intellectual structure". Journal of the American Society for Information Science. 32 (3): 163–171. doi:10.1002/asi.4630320302.
  • McCain, K. W. (1986). "Co-cited author mapping as a valid representation of intellectual structure". Journal of the American Society for Information Science. 37 (3): 111–122. doi:10.1002/(sici)1097-4571(198605)37:3<111::aid-asi2>3.0.co;2-d.
  • Culnan, M. J. (1987). "Mapping the intellectual structure of MIS, 1980-1985: A co-citation analysis". MIS Quarterly. 11 (3): 341–353. doi:10.2307/248680. JSTOR 248680.
  • McCain, K. W. (1990). "Mapping authors in intellectual space: a technical overview". Journal of the American Society for Information Science. 41 (6): 433–443. doi:10.1002/(sici)1097-4571(199009)41:6<433::aid-asi11>3.0.co;2-q.
  • Hoffman, D. L.; Holbrook, M. B. (1993). "The intellectual structure of consumer research: A bibliometrics study of author co-citations in the first 15 years of the journal of consumer research". Journal of Consumer Research. 19 (4): 505–517. doi:10.1086/209319.
  • Eom, S. B. (1996). "Mapping the intellectual structure of research in decision support systems through author cocitation analysis (1971-1993)". Decision Support Systems. 16 (4): 315–338. doi:10.1016/0167-9236(95)00026-7.

Citation Studies in a More General Context edit

  • Small, Henry (1978). "Cited Documents as Concept Symbols" (PDF). Social Studies of Science. 8 (3): 327–340. doi:10.1177/030631277800800305. S2CID 145538259.
  • Henry Small (1982). "Citation context analysis." In: Brenda Dervin and M. J. Voigt, eds., Progress in Communication Sciences, volume 3, pp. 287–310. Ablex Publishing, 1982.
  • Blair, David C.; Maron, M. E. (1985). "An evaluation of retrieval effectiveness for a full-text document-retrieval system". Communications of the ACM. 28 (3): 289–299. doi:10.1145/3166.3197. hdl:2027.42/35415. S2CID 5144091.
  • Brin, Sergey; Page, Lawrence (1998). "The anatomy of a large-scale hypertextual Web search engine". Computer Networks and ISDN Systems. 30 (1–7): 107–117. CiteSeerX 10.1.1.115.5930. doi:10.1016/s0169-7552(98)00110-x. S2CID 7587743.
  • He, Yulan; Cheung Hui, Siu (2002). "Mining a web citation database for author co-citation analysis". Information Processing and Management. 38 (4): 491–508. doi:10.1016/s0306-4573(01)00046-2.
  • Bradshaw, Shannon (2003). "Reference Directed Indexing: Redeeming Relevance for Subject Search in Citation Indexes". Research and Advanced Technology for Digital Libraries. Lecture Notes in Computer Science. Vol. 2769. pp. 499–510. doi:10.1007/978-3-540-45175-4_45. ISBN 978-3-540-40726-3.
  • Ritchie, Anna; Teufel, Simone; Robertson, Stephen (2006). "Creating a test collection for citation-based IR experiments". Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics -. pp. 391–398. doi:10.3115/1220835.1220885. S2CID 16879847.
  • Iwayama, Makoto; Fujii, Atsushi; Kando, Noriko; Marukawa, Yozo (2006). "Evaluating patent retrieval in the third NTCIR workshop". Information Processing and Management. 42 (1): 207–221. doi:10.1016/j.ipm.2004.08.012.
  • Fujii, Atsushi (2007). "Enhancing patent retrieval by citation analysis". Proceedings of the 30th Annual international ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '07. pp. 793–794. doi:10.1145/1277741.1277912. ISBN 9781595935977. S2CID 12433507.
  • Strohman, Trevor; Croft, W. Bruce; Jensen, David (2007). "Recommending citations for academic papers". Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '07. pp. 705–706. doi:10.1145/1277741.1277868. ISBN 9781595935977. S2CID 11304924.
  • Ritchie, Anna; Robertson, Stephen; Teufel, Simone (2008). "Comparing citation contexts for information retrieval". Proceedings of the 17th ACM Conference on Information and Knowledge Mining - CIKM '08. pp. 213–222. doi:10.1145/1458082.1458113. ISBN 9781595939913. S2CID 15585395.
  • Schwarzer, Malte; Schubotz, Moritz; Meuschke, Norman; Breitinger, Corinna; Markl, Volker; Gipp, Bela (2016). "Evaluating Link-based Recommendations for Wikipedia" (PDF). Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries - JCDL '16. pp. 191–200. doi:10.1145/2910896.2910908. ISBN 9781450342292. S2CID 2597308.

Further reading edit

For an interesting summary of the progression of the study of citations see.[1] The paper is more a memoir than a research paper, filled with decisions, research expectations, interests and motivations—including the story of how Henry Small approached Belver Griffith with the idea of co-citation and they became collaborators, mapping science as a whole.

External links edit

  • CITREC, an evaluation framework for citation-based similarity measures including Bibliographic Coupling, Co-citation, Co-citation Proximity Analysis and others.[2]
  • Jeppe Nicolaisen, Bibliographic coupling in Birger Hjørland, ed., Core Concepts in Library and Information Science
  1. ^ Small, Henry (2001). "Belver and Henry". Scientometrics. 51 (3): 489–497. doi:10.1023/a:1019690918490. S2CID 5962665.
  2. ^ Bela Gipp, Norman Meuschke & Mario Lipinski, 2015. "CITREC: An Evaluation Framework for Citation-Based Similarity Measures based on TREC Genomics and PubMed Central" in Proceedings of the iConference 2015, Newport Beach, California, 2015.