Digital Arabic Periodical Editions: presentation at Books in Motion

Till Grallert

2016-05-07

Majallat al-Muqtabas between gray online libraries, large-scale scanning efforts, and programming tools: producing fully open, collaborative, and scholarly editions of early Arabic periodicals.

Project URL: https://www.github.com/tillgrallert/digital-muqtabas

Project blog: https://tillgrallert.github.io/digital-muqtabas

Slides: https://tillgrallert.github.io/slides/BIM2016

Twitter: @tillgrallert

Email:

1. The journal of al-Muqtabas

al-Muqtabas / المقتبس

1.1 Importance of mundane texts / periodicals

1.2 A two-fold problem

The consequence is a focus on “high” culture and canonical texts

1.3 State of digitisation

  1. gray online libraries / “crowd”-sourced transcriptions, e.g. al-Maktaba al-Shāmila, Mishkāt, Ṣayd al-Fawāʾid, al-Waraq etc.
    • lack of / faulty metadata
    • unknown editing principles
    • unknown quality
    • very limited structural mark-up
    • cannot be reliably cited
  2. Digital imagery, e.g. Endangered Archives Programme (EAP), HathiTrust, Institut du Monde Arabe.
    • lack of metadata
    • limited licences, paywalls
    • no or very bad text layers

2. Suggested solution: unite facsimile and transcription

  1. aims
    • validate the transcription against the facsimiles
    • improve the transcription with the help of the “crowd”
    • make everything citable for scholars, linkable for machines
    • provide the new edition with the broadest possible licence to facilitate access and re-use
  2. principles
    • re-purpose available and established tools, technologies, and material
    • preference for open and simple formats and tools

3. Test case: digital Muqtabas

Web-view of al-Muqtabas 6(2)

Web-view of al-Muqtabas 6(2)

3. Test case: digital Muqtabas

TEI file of al-Muqtabas 6(2) in oXygen: author mode

TEI file of al-Muqtabas 6(2) in oXygen: author mode

3. Test case: digital Muqtabas

TEI file of al-Muqtabas 6(2) in oXygen: plain XML

TEI file of al-Muqtabas 6(2) in oXygen: plain XML

3. Test case: digital Muqtabas

  1. Basis:
    • XML/TEI edition of all 96 issues (c. 7000 pages) of Muḥammad Kurd ʿAlī’s Majallat al-Muqtabas
    • The text links to open-access digital facsimiles
    • licenced as CC BY-SA 4.0
  2. Core feature:
    • social digital edition: gradually improve text and mark-up
  3. Sugar on top:
    • Static web-view (doesn’t require a permanent internet connection)
    • bibliographic metadata for all issues and articles (MODS, BibTeX)
    • access to bibliographic metadata through a public Zotero group

3. Test case: digital Muqtabas

Project scheme

Project scheme

3.1 Basis: Generate the TEI edition

3.2 Core feature: Continuous improvement

A social and GitHub-hosted digital edition

A social and GitHub-hosted digital edition

3.2 Core feature: Continuous improvement

  1. Improvements depending on human labour (probably a “crowd”)
    • correct the transcription
    • add structural mark-up
    • add semantic mark-up
  2. Automatic improvements:
    • provide reliable bibliographic metadata based on the facsimile
    • mark-up of natural entities with link to external reference files (e.g. personal names, toponyms)

3.2 Core feature: how to contribute

Branches on GitHub

Branches on GitHub

3.3 Sugar on top: web-view

3.3 Sugar on top: web-view

Display of al-Muqtabas 6(2)

Display of al-Muqtabas 6(2)

3.3 Sugar on top: Zotero group

Zotero group digital-muqtabas: list view

Zotero group “digital-muqtabas”: list view

3.3 Sugar on top: Zotero group

Zotero group digital-muqtabas: item view

Zotero group “digital-muqtabas”: item view

3.4 Use cases: reviewed works

4. To do, ongoing work

5. Experiences: simple, fast, sustainable

Summary

Thank you !

Project URL: https://www.github.com/tillgrallert/digital-muqtabas

Project blog: https://tillgrallert.github.io/digital-muqtabas

Slides: https://tillgrallert.github.io/slides/BIM2016

Twitter: @tillgrallert

Email:


  1. even in the US as attested to by HathiTrust