Venice Time Machine Explained

The Venice Time Machine is a large international project launched by the École Polytechnique Fédérale de Lausanne (EPFL) and the Ca' Foscari University of Venice in 2012 that aims to build a collaborative multidimensional model of Venice by creating an open digital archive of the city's cultural heritage covering more than 1,000 years of evolution.[1] The project aims to trace circulation of news, money, commercial goods, migration, artistic and architectural patterns amongst others to create a Big Data of the Past.[2] Its fulfillment would represent the largest database ever created on Venetian documents.[3] The project is an example of the new area of scholar activity that has emerged in the Digital Age: Digital Humanities.

The project's widespread critical acclaim led to the submission of a European counterpart proposal to the European Commission in April 2016.[4] The Venice Time Machine forms the technological basis of the proposed European Time Machine.[5]

The first full reconstruction of Venice showing the evolution of the city between 900 and 2000 was shown at the Venice Biennale of Architecture in 2018.[6] The Venice Time Machine model of the city of Venice in 1750 was also used for an exhibition at the Grand Palais in Paris in September 2018.[7]

Organisation and funding

The Venice Time Machine Project was launched by EPFL and the Ca' Foscari University of Venice in 2012. It includes collaboration from major Venetian patrimonial institutions: the State Archive in Venice, The Marciana Library, The Instituto Veneto and the Cini Foundation. The project is currently supported by the READ (Recognition and Enrichment of Archival Documents) European project, the SNF project Linked Books and ANR-SNF Project GAWS. The international board includes renowned scholars from Stanford, Columbia, Princeton, and Oxford. In 2014, The Lombard Odier Foundation joined the project Venice Time Machine as a financial partner.[8]

Technology and tools

The State Archives of Venice contain a massive amount of hand-written documentation in languages evolving from medieval times to the 20th century. An estimated 80 km of shelves are filled with over a thousand years of administrative documents, from birth registrations, death certificates and tax statements, all the way to maps and urban planning designs. These documents are often very delicate and are occasionally in a fragile state of conservation. The diversity, amount and accuracy of the Venetian administrative documents are unique in Western history. By combining this mass of information, it is possible to reconstruct large segments of the city's past: complete biographies, political dynamics, or even the appearance of buildings and entire neighborhoods.

Scanning

Paper documents are turned into high-resolution digital images with the help of scanning machines. Different types of documents impose various constraints on the type of scanning machines that can be used and on the speed at which a document can be scanned. In partnership with industry, EPFL is working on a semi-automatic, robotic scanning unit capable of digitizing about 1000 pages per hour. Multiple units of this kind will be built to create an efficient digitization pipeline adapted to ancient documents. Another solution currently being explored at EPFL involves scanning books without turning the pages at all. This technique uses X-ray synchrotron radiation produced by a particle accelerator.[9]

Transcription

The graphical complexity and diversity of hand-written documents make transcription a daunting task. For the Venice Time Machine, scientists are currently developing novel algorithms that can transform images into probable words. The images are automatically broken down into sub-images that potentially represent words. Each sub-image is compared to other sub-images, and classified according to the shape of word it features. Each time a new word is transcribed, it allows millions of other word transcripts to be recognized in the database.

Text processing

The strings of probable words are then turned into possible sentences by a text processor. This step is accomplished by using, among other tools, algorithms inspired by protein structure analysis that can identify recurring patterns.

Connecting data

The real wealth of the Venetian archives lies in the connectedness of its documentation. Several keywords link different types of documents, which makes the data searchable. This cross-referencing of imposing amounts of data organizes the information into giant graphs of interconnected data. Keywords in sentences are linked together into giant graphs, making it possible to cross-reference vast amounts of data, thereby allowing new aspects of information to emerge.

The Digital Humanities Laboratory of EPFL announced on 1 March 2016 the development of REPLICA, a new search engine for the study and enhanced use of the Venetian cultural heritage to be online by the end of 2016.[10]

Reception

Praise

Criticism

Other consequences

See also

External links

Notes and References

  1. Web site: In Brief – VTM.
  2. 10.1038/546341a. 28617482. The 'time machine' reconstructing ancient Venice's social networks. Nature. 546. 7658. 341–344. 2017. Abbott. Alison. 2017Natur.546..341A. free.
  3. Book: Kaplan. Frédéric. Proceedings of the 2015 ACM Symposium on Document Engineering . The Venice Time Machine . 7114931. 2015. 73. 10.1145/2682571.2797071. 9781450333078.
  4. Web site: Kaplan. Frederic. Venice Time Machine Flagship. European Commission. 9 May 2017. 29 April 2016.
  5. Book: Kaplan. Frédéric. Proceedings of the 2015 ACM Symposium on Document Engineering . The Venice Time Machine . 7114931. 2015. 73. 10.1145/2682571.2797071. 9781450333078.
  6. http://padiglionevenezia.it/casi/venice-time-machine/
  7. Web site: Eblouissante Venise!.
  8. Web site: Partners – VTM.
  9. Margaritondo. Giorgio. Kaplan. Frédéric. Hwu. Yeukuang. Peccenini. Eva. Stampanoni. Marco. Albertin. Fauzia. X-Ray Spectrometry. X-ray spectrometry and imaging for ancient administrative handwritten documents. 2015. 44. 3. 93–98. 10.1002/xrs.2581. 2015XRS....44...93A. 93245100 .
  10. Replica. 22 July 2019.
  11. Kaplan. Frédéric. Bornet. Cyril. Buntinx. Vincent. Studying Linguistic Changes over 200 Years of Newspapers through Resilient Words Analysis. Frontiers in Digital Humanities. 2017. 4. 2. 10.3389/fdigh.2017.00002. free.
  12. Book: Gardiner. Eileen. Musto. Ronald G.. The digital humanities : a primer for students and scholars. 2015. Cambridge University Press. New York, N.Y.. 978-1-107-01319-3. 149.
  13. de Montjoye. Yves-Alexandre. Hidalgo. Cesar A.. Verleysen. Michel. Blondel. Vincent D.. Unique in the crowd: The privacy bounds of human mobility. Scientific Reports. 2013. 3. 1376. 10.1038/srep01376. 23524645. 3607247. 2013NatSR...3E1376D.
  14. Fairfield. Joshua. Stein. Hannah. 145698329. Big Data, Big Problems: Emerging Issues in the Ethics of Data Science and Journalism. Journal of Mass Media Ethics. 2014. 29. 38–51. 10.1080/08900523.2014.863126.
  15. Liu. Alan. Thomas III. William G.. Humanities in the Digital Age. Inside Higher Ed. 2012.