COCOA (digital humanities)

Summary

COCOA (an acronym derived from COunt and COncordance Generation on Atlas) was an early text file utility and associated file format for digital humanities, then known as humanities computing. It was approximately 4000 punched cards of FORTRAN and created in the late 1960s and early 1970s at University College London and the Atlas Computer Laboratory in Harwell, Oxfordshire. Functionality included word-counting and concordance building.[1][2][3][4]

Oxford Concordance Program edit

The Oxford Concordance Program format was a direct descendant of COCOA developed at Oxford University Computing Services. The Oxford Text Archive holds items in this format.[5]

Later developments edit

The COCOA file format bears at least a passing similarity to the later markup languages such as SGML and XML. A noticeable difference with its successors is that COCOA tags are flat and not tree structured. In that format, every information type and value encoded by a tag should be considered true until the same tag changes its value. Members of the Text Encoding Initiative community maintain legacy support for COCOA,[6][7] although most in-demand texts and corpora have already been migrated to more widely understood formats such as TEI XML.[8]

References edit

  1. ^ Paul E. Corcoran (November 1974). "COCOA: A FORTRAN Program for Concordance and Word-count Processing of Natural Language Texts". Behavior Research Methods & Instrumentation. 6 (6): 566. doi:10.3758/BF03201351.
  2. ^ Colin Day and Ian Marriott (February 1976). "Software Reviews: COCOA: A Word Count and Concordance Generator". Computers and the Humanities. 10 (1): 56. doi:10.1007/BF02399143. S2CID 198177017.
  3. ^ D. B. Russell (1965). "COCOA - A Word Count and Concordance Generator". Associates Technology Literature Applications Society. Retrieved 20 October 2013.
  4. ^ Susan Hockey. "The History of Humanities Computing". University of Illinois. Archived from the original on 18 September 2013. Retrieved 20 October 2013.
  5. ^ Gratian, 12th Cent (14 January 1987). "Concordia discordantium canonum ac primum de iure naturae et constitutionis". University of Oxford Text Archive. Retrieved 20 October 2013.{{cite web}}: CS1 maint: numeric names: authors list (link)
  6. ^ James Cummings, Sebastian Rahtz (2010). "This script is used to convert COCOA to TEI" (XSLT). Oxford University. Retrieved 3 April 2018.
  7. ^ "Stylesheets/Cocoa at dev · TEIC/Stylesheets". GitHub.
  8. ^ "Corpus Resource Database (CoRD)".