LIS Links

First and Largest Academic Social Network of LIS Professionals in India

Welcome to
LIS Links

Or sign in with:

LIS Links Becoming More Social

LIS Links Mailing List (Broadcast Message)

LIS Links WhatsApp
LIS Links Telegram Channel
LIS Links Telegram Group
LIS Links Facebook Page
LIS Links Facebook Group
LIS Links Twitter Profile
LIS Links YouTube Channel

Birthdays

Birthdays Today

Latest Activity

Reena Tomar and S. Jaffer Basha are now friends

13 minutes ago

Sheela Narayan posted a status

"When kvs form 2025 will come for the permanent post of librarian"

14 hours ago

0 Comments

Dr. B.K. Dalai updated their profile

17 hours ago

Dr. Badan Barman, Kankana B Badan, Munesh Kumar and 7 more were featured

4 more…

18 hours ago

HARSH, Sheela Narayan, ASHWINI KULKARNI and 1 more joined LIS Links

18 hours ago

VARUN KUMAR updated their profile

19 hours ago

AMRUTA RAVIKIRAN SHINDE updated their profile

21 hours ago

Dr. U. Pramanathan posted blog posts

2 more…

yesterday

SATYAM PRAKASH TRIPATHI posted a blog post

Library Professional Trainee

yesterday

0 Comments

Rohit Jangra posted a blog post

Invitation of applications for Dr. S. R. Ranganathan Internship in BHU Libraries

yesterday

0 Comments

Suresh Kumar Dubey posted a blog post

Applications are invited for engagement of Resource Centre Trainees in the NIFT Resource Centre, Delhi

yesterday

0 Comments

Narender Kumar posted a blog post

Engagement of Apprentices in Library and Information Science at ICSSR- NASSDOC, New Delhi for One Year 2025.

yesterday

0 Comments

Manjula, Chandan Mishra, Dr. B. Mini Devi and 10 more joined LIS Links

yesterday

S. S. Waghmode posted an event

Online National Seminar on "Preservation and Protection of Indian Knowledge (IKS) through Intellectual Property Rights in the Digital Age" at Online

March 11, 2025 from 9am to 5pm

yesterday

0 Comments

Dr. U. Pramanathan posted events

8th International Conference of Asian Libraries Building Smart Libraries: Leveraging AI Based Technologies
April 24, 2025 at 6pm to April 26, 2025 at 7pm
International Conference of Asian Special Libraries (ICoASL-2025) on Sustainable Development of Libraries through Innovative Technologies
May 9, 2025 at 6pm to May 11, 2025 at 7pm
Two-Day National Conference on "Sustainable Development Strategies for Transformation of LIS Education and Research (SDSTLISER-2025)"
February 28, 2025 at 6pm to March 1, 2025 at 7pm

yesterday

Dr.K.S.SHIVRAJ posted an event

4th International Conference on Knowledge Management in Higher Education Institutions (ICKHI 2025) at Jaipur

February 13, 2025 to February 14, 2025

yesterday

0 Comments

Dr. U. Pramanathan posted discussions

yesterday

Velmurugan.C posted a discussion

Librarian Designation-Sending Feedback to UGC-reg

yesterday

0 Comments

Dimple posted a discussion

Guidance for Setting up a Central Library

yesterday

0 Comments

amreentaj posted a discussion

Invitation for Metadata and AI Survey of DCMI Education Committee

yesterday

0 Comments

PDF Metadata Extractor Information Needed

Dear Friends

We are in need of a PDF Metadata Extractor Information, preferably free and not online. Please share the information if anybody using it. Actually it is for using in combination with DSpace software, but we can not go online with our collection.

Any help will be highly appreciated.

Thank you

Subeesh A C

▶ Reply to This

Replies to This Forum

Permalink Reply by Baskar Selvaraj on February 16, 2016 at 15:09

Try ExitTool

http://www.sno.phy.queensu.ca/~phil/exiftool/

I have been using it for extracting metadata from PDFs for using in DSpace. It is possible to extract metadata from all PDFs at one go, if you are familiar with command line options.

S. Baskar

▶ Reply

Permalink Reply by Subeesh A C on February 17, 2016 at 0:25

Thank you very much sir

But I think the tool is extracting data from document properties in my try. Are you getting the appropriate data with exiftool?

Subeesh A C

▶ Reply

Permalink Reply by Baskar Selvaraj on February 17, 2016 at 4:45

Hi,

Using the below command, you can extract all metadata (i.e. all metadata tags associated with the PDF document) from hundreds of PDF documents and save it as CSV file which could be used for doing batch import within DSpace.

In case, if you require only specific tags, then you have to mention the required metadata tags for extracting. I have given an example below for your understanding.

To extract all available metadata tags from the PDF documents and save it as a CSV file

---------------------------------------------------------------------------------------------------------------------

exiftool -csv *.pdf > output.csv

To extract specific metadata tags from the PDF documents and save it as a CSV file

-----------------------------------------------------------------------------------------------------------------------------

exiftool -TAG -Title -TAG -Author -TAG -Producer -TAG -Subject -TAG -Description -TAG -Type -TAG -Keywords -TAG -ISBN -TAG -Isbn -TAG -Createdate -TAG -CourseID -TAG -FileSize -TAG -PageCount -TAG -PDFVersion -d %Y-%m-%d *.pdf -csv > output.csv

Hope this helps.

S. Baskar

LinuXpert Systems

▶ Reply

Permalink Reply by Baskar Selvaraj on February 17, 2016 at 4:54

ExifTool Tag Names

The tables listed below give the names of all tags recognized by ExifTool.

http://www.sno.phy.queensu.ca/~phil/exiftool/TagNames/index.html

▶ Reply

Permalink Reply by Subeesh A C on February 20, 2016 at 19:27

Thank you very much sir

▶ Reply

Permalink Reply by Mujib Rahiman K U on February 17, 2016 at 18:34

I have created a small uitlity for extracting information from pdf files few years ago . it will extract data from all files in a folder and save in tab delimited text file.

you can try it. hope it helps. pls let me know.

i have uploaded the program to google drive. Click here to download

with regards

Mujib Rahiman

KV Kanjikode

▶ Reply

Permalink Reply by Subeesh A C on February 20, 2016 at 19:28

Thanks sir, I will surely let you know.

Regards

Subeesh A C

▶ Reply

Permalink Reply by Subeesh A C on February 22, 2016 at 23:38

Sir

I have checked your software, its a great effort if you have coded it yourself. As I see most of the software(s) are not able to identify the pdf files metadata as we require. I think the problem is mostly revolve around the structure of pdf files itself. In my case the pdf files are not having any standard structure (+ OCR ) in it for the algorithm to extract as it did for any appropriate one. Since we are in hurry and we require more metadata for the current work, we are thinking of indexing it and filtering it later through various categories. Anyway thanks for your reply.

LIS Links

LIS Links Becoming More Social

Birthdays

Birthdays Today

Latest Activity

Advertisement for Assistant Librarian (UR-02) Non-faculty Group-A Positions @ The Rajiv Gandhi National Aviation University, Amethi District, Uttar Pradesh.

Vacancy Notice / Advertisement No. 09/2024 Combined Examination, 2024 for Librarian @ Delhi Subordinate Services Selection Board (DSSSB), Delhi.

Vacancy Circular: It is proposed to fill up the post of Librarian on deputation /absorption basis @ Appellate Tribunal for Electricity (APTEL), New Delhi.

Library Professional Trainee

Invitation of applications for Dr. S. R. Ranganathan Internship in BHU Libraries

Applications are invited for engagement of Resource Centre Trainees in the NIFT Resource Centre, Delhi

Engagement of Apprentices in Library and Information Science at ICSSR- NASSDOC, New Delhi for One Year 2025.

Online National Seminar on "Preservation and Protection of Indian Knowledge (IKS) through Intellectual Property Rights in the Digital Age" at Online

8th International Conference of Asian Libraries Building Smart Libraries: Leveraging AI Based Technologies

International Conference of Asian Special Libraries (ICoASL-2025) on Sustainable Development of Libraries through Innovative Technologies

Two-Day National Conference on "Sustainable Development Strategies for Transformation of LIS Education and Research (SDSTLISER-2025)"

4th International Conference on Knowledge Management in Higher Education Institutions (ICKHI 2025) at Jaipur

UGC Ph.D. Excellence Citation

ONOS One Nation One Subscription-An Initiative of Govt. of IndiaONOS One Nation One Subscription-An Initiative of Govt. of India

Digital Leadership For Libraries

Librarian Designation-Sending Feedback to UGC-reg

Guidance for Setting up a Central Library

Invitation for Metadata and AI Survey of DCMI Education Committee

PDF Metadata Extractor Information Needed

Replies to This Forum

ExifTool Tag Names