DIGITISATION OF MARATHI MANUSCRIPTS

Similar documents
Digitization, Digital Preservation, Rare Manuscripts, Museums and Documents Centre of Astan Quds Razavi Library, Iran

DIGITIZATION OF PUNE UNIVERSITY BULLETIN: A CASE STUDY

The CYCU Chang Ching Yu Memorial Library Resource Development Policy

from physical to digital worlds Tefko Saracevic, Ph.D.

Digitization : Basic Concepts

Digital Preservation of Rare Books & Manuscripts: A Case Study of Aligarh Muslim University

Internship Report. Project

Conference of Directors of National Libraries in Asia and Oceania Annual meeting of 2018 at the National Library of Myanmar (Naypyitaw), Myanmar

Myanmar Country Report to CDNL-AO 2011

Instructions to Authors

Date Effected May 20, May 20, 2015

WESTERN PLAINS LIBRARY SYSTEM COLLECTION DEVELOPMENT POLICY

NLI Update Elhanan Adler, Marina Goldsmith

2009 CDNLAO COUNTRY REPORT

JAMAICA. Planning and development of audiovisual archives in Jamaica. by Anne Hanford. Development of audiovisual archives

New Challenges : digital documents in the Library of the Friedrich-Ebert-Foundation, Bonn Rüdiger Zimmermann / Walter Wimmer

LIBRARY AND INFORMATION SERVICES COLLECTION DEVELOPMENT GUIDELINES FOR SPECIAL COLLECTIONS

Open Source Software for Arabic Citation Engine: Issues and Challenges

Searching for Special Collections Material in idiscover

Publication Policy and Guidelines for Authors

Collection Development Policy J.N. Desmarais Library

INFS 427: AUTOMATED INFORMATION RETRIEVAL (1 st Semester, 2018/2019)

Global Memory Net Offers New Innovative Access to Tsurumi s Old Japanese Waka Poems and Tales, and Maps

Library on Gender and Equality & Historical Archive of the General Secretariat for Gender Equality of Greece (Ministry of the Interior)

Welsh print online THE INSPIRATION THE THEATRE OF MEMORY:

CHAPTER V SUMMARY OF FINDINGS, SUGGESTIONS AND CONCLUSION

COLLECTION DEVELOPMENT POLICY

Your Research Assignment: Searching & Citing

The multicultural-scope of the services offered by the Miguel de Cervantes digital library project.

Instructions to Authors

Comparing gifts to purchased materials: a usage study

CITATION ANALYSES OF DOCTORAL DISSERTATION OF PUBLIC ADMINISTRATION: A STUDY OF PANJAB UNIVERSITY, CHANDIGARH

Development of Classical Tamil Digital Library: CIIL Experience. Abstract

ARAB REPUBLIC. Introduction of Machine-Readable Cataloguing at the National Information and Documentation Centre. SeppoVuorinen

SAMPLE COLLECTION DEVELOPMENT POLICY

I. GENERAL OVERVIEW OF RECENT MAJOR DEVELOPMENTS AND RELATIONSHIP TO GOVERNMENT

Purpose Aims Objectives... 2

"Libraries - A voyage of discovery" Connecting to the past newspaper digitisation in the Nordic Countries

Collection Development Policy

Collection Management Policy

Instructions to Authors

Announcements. Project Turn-In Process. and URL for project on a Word doc Upload to Catalyst Collect It

Excerpt of the new core provisions. Article 1. Amendment of the Act on Copyright and Related Rights

Digitization Project of the Historical Archives of Macao

This presentation does not include audiovisual collections that are in possession

Chaining Sources in Social Science Research. Chaim Kaufmann February 1, 2007

Policy on Donations. The Library s Collection Development Strategy is to acquire such materials as

Collection Development Duckworth Library

RFID BASED LIBRARY MANAGEMENT SECURITY SYSTEM Shushant Kumar Singh, Avinow Raj, ShahinaFirdoush, and ShrutiKriti

Música a la llum : the Access to Music Archives IAML project adapted to the wind bands of the region of Valencia

INFO 665. Fall Collection Analysis of the Bozeman Public Library

EAP269: Preliminary survey of Arabic manuscripts in Djenne, Mali, with a view to a major project of preservation, digitisation and cataloguing

Mercy International Association. Standards for Mercy Archives

Faculty Governance Minutes A Compilation for online version

The Korean Film Archive

Electronic Records in Maine. Presented by Nina M. Osier, Director Division of Records Management Services Maine State Archives May 20, 2008

Western library practices: a few ways to build connections with library users, and why!

Conway Public Library

GUIDE TO SERVICES OF THE DAR LIBRARY

Japan Library Association

COLLECTION DEVELOPMENT AND MANAGEMENT POLICY BOONE COUNTY PUBLIC LIBRARY

Digital Humanities from the Ground Up: The Tamil Digital Heritage Project at the National Library, Singapore

Akron-Summit County Public Library. Collection Development Policy. Approved December 13, 2018

1. Controlled Vocabularies in Context

La Porte County Public Library Collection Development Policy

DIGITISATION GUIDELINES

Annual Report of the IFLA-PAC China Center

3 Year B.A./B.Sc. (Honours) in Library and Information Studies UNIVERSITY OF CALCUTTA. 1 P age

1/29/2008. Announcements. Announcements. Announcements. Announcements. Announcements. Announcements. Project Turn-In Process. Quiz 2.

Announcements. Project Turn-In Process. Project 1A: Project 1B. and URL for project on a Word doc Upload to Catalyst Collect It

Manual and Guidelines. For. Library Automation Software Version

Citation Analysis of Dissertations of Law Submitted to University of Delhi

Instructions to Authors

7 - Collection Management

Malaysian E Commerce Journal

Department of American Studies M.A. thesis requirements

Date Revised: October 2, 2008, March 3, 2011, May 29, 2013, August 27, 2015; September 2017

LIBRARY & ARCHIVES MANAGEMENT PRACTICE COLLECTION MANAGEMENT

ISO 2789 INTERNATIONAL STANDARD. Information and documentation International library statistics

Collection Development Policy Western Illinois University Libraries

Automation of Processes in the National Library of China: Historical Review and Future Perspective

Biometric Voting system

Automated Cataloging of Rare Books: A Time for Implementation

The CLA HE Trial Scanning Licence how we re using it.

Data Representation. signals can vary continuously across an infinite range of values e.g., frequencies on an old-fashioned radio with a dial

AIATSIS Library Collection Development Policy

Newsletter January 2017

Principal version published in the University of Innsbruck Bulletin of 4 June 2012, Issue 31, No. 314

COUNTRY REPORT. National Library of Cambodia for the CDNLAO Meeting on 7. May.2007

The Art of finding an illustration or just Google it!

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling

Library Resources for Faculty

Instructions to Authors

O P E R A T I O N M A N U A L. RF-Reader. Stand-alone-Reader Leser 2plus with RS-232 interface

OUR LIBRARY. Used by scientists, lecturers, experts, students and citizens. The special multidiscipline library of the Bulgarian Academy of Sciences.

J. ISSN: The ISSN/EAN-13 barcode has the following components:

Author Guidelines for Preparing Manuscript: Manuscript file format

Electronic Publishing

Library 101. To find our online catalogue, Discover from the HSP home page, first see Collections then Catalogues and Research Tools.

IRIS Online Catalog Handbook

Transcription:

DIGITISATION OF MARATHI MANUSCRIPTS By Dr. ( Mrs.) N.J. Deshpande* Mr. B.M. Panage** ABSTRACT The present paper discusses the importance of manuscript collection and the necessity to preserve it for the future research and references. It highlights on the existing precious manuscript collection of Jayakar library. The paper tries to convert document in the digital form to preserve and maintain it. More emphasis is given on conversion of Marathi manuscripts available in the Jayakar library. It also discusses various methods of capturing data and its advantages and disadvantages. The central theme of the paper is to find out, how much space is required to store one image in various file formats like bitmap, tif and jpeg, which may act as a guideline for preservation and maintenance of manuscripts. * Librarian and Head, Dept. of LIS, Jayakar Library, University of Pune, Pune - 411 007. E-mail : njd@lib.unipune.ernet.in ** Assistant Librarian, Jayakar Library, University of Pune, Pune - 411007. E-mail : bmpanage@rediffmail.com 0. Introduction Manuscripts are precious part of our cultural heritage. Great ancient scholars dedicated their lives for creating written records of knowledge. India's most valued and revered gift to humanity is its profound and timeless heritage. This heritage encompasses almost every aspect of human enquiry, exploration and existence covering philosophy, religion, language, literature, metaphysics, art and dance and so on. Today this heritage is scattered in texts in libraries and in individual positions. This precious gift is slowly decaying and vanishing due to the improper handling. Indeed, preservation of this heritage presents a great challenge before us. However, fortunately the information technology is offering many solutions not only for preservation, but also for enhancement and for its wide scale access. Knowing the fact that the treasure of manuscripts still lying scattered in the nooks and corners of India, Dr. Raghavan suggested University Grant Commission to appoint a manuscript committee. Accordingly, U.G.C. appointed a manuscript committee under the chairmanship of Dr.V. Raghvan in 1959. This committee visited many places and found that the preservation and other type of work related to manuscripts was not satisfactory and made following suggestion, " unless a systematic policy for the collection, preservation and utilization of manuscripts was pursued in right way, there is a danger of

manuscripts being taken out of the country by foreign agencies, as also destruction because of ignorance and negligence." Many efforts are made to preserve the manuscripts and nowadays, many agencies have carried out digitisation projects. Some projects are completed, while some are at the initial stage. It is found that many Indic manuscripts are available in foreign universities and they have developed their digitized collection of manuscripts. One of such universities is University of Pennsylvania. 1. Manuscript Collection of Jaykar Library Jayakar library has a very rich collection of manuscripts and at present this library holds around 5000 manuscripts. The present collection of manuscripts is enriched by acquiring it from various parts of India, These manuscripts are purchased or received as donations during the last five decade. This collection extends over important branches of learning and contains titles of many unpublished work. Jayakar Library has published the descriptive catalogue of manuscripts available in its collection, which contains Sanskrit, Marathi and Hindi manuscripts. There are about 700 Marathi manuscripts, out of which three collections are eminent, namely Ketkar collection, Dattavarada Vitthala and Mahanubhava collection.. 2. Digitisation (definition) "Digitisation refers to the process of translating a piece of information such as a book, sound recording, picture or video into bits. Bits are the fundamental units of information in a computer system. Turning information into these binary digits is called digitisation." Digitisation is one of the hot topics in librarianship today. To build a 'digital library' requires that the content of a collection be available electronically. The rhetoric of the information highway has provided the impetus to convert many existing paper-based ( or sound, video) collection into new digital media. The assumption is that digital collections will be more accessible to a broader range of users, presumably through networking techniques, and new efficiencies are to be gained in resource sharing and for preservation. 3. Digitisation Process Digitisation requires a basic process, which involves different sets of hardware and software technologies at each step. Determining the appropriate technology is directly linked to the anticipated use and purpose of the material being digitized. For digitizing the text and other material, following five methods can be used. a) Manual data entry Scanning b) Optical character recognition ( OCR) c) Excalibur Technologies and pattern recognition technologies d) Document imaging

a) The simplest method of converting an image of a page (or the real page of text) into digital text is to enter it manually. This is usually a time consuming method but very useful from the point of view of information retrieval. b) In the second method, scanners are used to take digital pictures of objects. Scanners can be simple desk top machines or very large and complex systems that process thousands of documents. c) Another simple digitization process is of OCR i.e. scanning printed pages to build a digital database of text. This process uses OCR (Optical Character Recognition) software, which takes a picture of the page and then turns it into digital text, which can be edited or fully indexed. OCR software must distinguish between black and white areas of text. d) Excalibur Technologies and Pattern Recognition Technologies are the next generation of OCRs, represented by Pixie, a product being developed by Excalibur Technologies. This software uses a technology called Adaptive Pattern Recognition, which attempts to mimic aspects of the neural patterns of the brain. The software can be taught to recognize variations and relationships in pattern, such as patterns of text rather than readable text. The retrieval of search terms uses what Excalibur calls "fuzzy matching". e) Document Imaging, a simple method of capturing text, involves taking an electronic picture of each page of text with the same type of scanner as one would use for OCR. However, the difference is that the images are stored as graphic files rather than text files. A similar technology is used for fax transmission. Each page is stored as one picture. The text on the page cannot be edited or indexed. 4. Methodology This paper discusses in detail, how to convert the Marathi manuscripts available in Jayakar Library into digital form. One of the aspect of digitization is preservation and the authors have tried to preserve the manuscripts with the help of first and second methods which are mentioned above. Following three steps are used for digitization of manuscript :- 1. Scanning the original manuscript folios and preserving the image. 2. Making manual data entry using Marathi software. 3. Preparing an index and translating the original text into English for foreign scholars. 5. Collection of Dattavarada Vitthala Dattavarada Vitthala ( Sake 1670-1720) was a Marathi poet, who lived 200 years ago. He was a contemporary of the Peshwas Nanasaheb and Madhavrao. Most of his works are unpublished. These manuscripts are discovered by the late Shri. Kashinatha Panduranga Parakhi. These manuscripts were evaluated by M.M. Datto Vaman Potdar

and proved to be a valuable material for research in Marathi. There are around 52 manuscripts which are available in the Jayakar library, out of which, first ten manuscripts are selected for this pilot study. The details of these manuscripts are shown in the table given on next page. Serial Number Accession Number Title 2081 2515 Adhyatma Ramayan Balakanda 2082 2533 Atmaprabhoda 2083 2524 ( Bhagavat) Gitasara 2084 2516 Bhagavata- Caturtha(4th) skanda tika 2085 2517 Bhagavata Pancama(5th) skanda tika 2086 2518 Bhagavata-Saptama(7th) skanda tika 2087 2552 Bhavanidasaka stotra caranavyatha, Martanda dasakastotra and other works 2088 2544 Dattatraya lilavigraha 2089 2541 Dhyanacaurhasi 2090 2542 Ganesh panchayatana Pancastaka stotra The scholar or user can access any one the manuscripts from the above table. He can select the manuscripts just by clicking on the accession number or on the title of the manuscripts. After clicking on the particular manuscripts, following information will be displayed on the screen. The description of theses manuscripts is based on the descriptive catalogue of manuscripts which are published by Jayakar Library, University of Pune. Dhyanacaurhasi Sr.No. 2089 Acc.No. 2541 Title Dhyanacaurhasi Author Dattavarada Vitthala Commentator Material Paper Script Devnagari Size in Cms. 23 x 13 Folios 19 Lines per Page 10 Letters per line 21 Extent C Condition and Age G Additional Particulars Further information can be accessed by folio number ( page number). i.e. if you want to see the information of folio number1, then just click on that folio. Folio 1 Folio 2 Folio 3 Folio 4 Folio 5 Folio 6 Folio 7 Folio 8 Folio 9 Folio 10 Folio 11 Folio 12 Folio 13 Folio 14 Folio 15 Folio 16 Folio 17 Folio 18 Folio 19

As soon as you select the folio number to see the information of that folio, you can see the original page of manuscript, which is scanned and will be displayed in the following manner Technical details :- The above image is scanned using the following details a) Resolution - 100 b) Sharpen level - low c) Pixels - 522 x 930 The total number of bytes required to store one page of manuscripts ( 23cm x 13cm) is 1,391 KB, if the image is saved in bit map structure. The same page requires 1,397 KB if it is saved using tif formats and 96 KB are required, if it is saved in jpeg format. In short, a single floppy disk (1.44MB) is required to store one image which is captured by the scanner. User can select any folio to see the original contents of the manuscript. The original image of the page, which is scanned using some standard scanner, will be displayed. And at the same time, English translation of the above folio will be displayed at the end. 6. Conclusion The UNESCO project entitled ' The Memory of the World' was built on the premise that the cultural society has the responsibility to preserve information about the history and make it available also for future generations. It aims to stimulate a responsible approach to the sources from which our historical consciousness grows and to contribute to the general availability of information about our history and culture. The abundance of information of average and below average quality generates paradoxically the demand for new, unusual, exotic and uneasily available information. This explains the growing interest in old manuscripts. However manuscripts are not available easily, so digitization is the solution for preservation and access of manuscripts. It is observed that at present, scanning is the suitable alternative for storing manuscripts. This maneuver will serve as a sort of guideline for the preservation of such type of manuscripts. Indeed, this is a challenging and promising task but one has to undertake such kind of activity which will not only help librarians, library professionals but the entire humanity as a whole! 7. References 1. Hampson, Andrew, " Managing a digitization project" Aslib managing information, 5: 10, December 1998. pp.25-32. 2. Hampson, Andrew, " Scanning in the right direction" Library Technology 4(6) November 1999. pp.79-80.

3. Kuny, Terry," An introduction to digitization technologies and issues" Network notes no.14, National Library of Canada, October 1, 1995. 4. Mahajan, S.G. " Descriptive catalogue of manuscripts available in the Jayakar Library, University of Pune, vol. I part II Marathi manuscripts 1986. 5. University Grant Commission, Manuscript Committee ( Chairman Dr. V. Raghvan), Manuscripts catalogues. Bangalore, 1963 p.4 6. University of Pennsylvani, Sanskrit manuscripts available at http://wwwlibrary.upenn.edu/etext/sasia/ski-mss/index.