Metadata Extraction Tool - Introduction PDF

The Metadata Extraction Tool was developed by the National Library of New Zealand to automatically extract preservation metadata from a variety of file formats like PDFs, images, sound files, and Microsoft Office documents. It outputs the metadata in a standard XML format for use in preservation activities. The tool supports over a dozen file formats and can extract technical metadata as well as metadata embedded in files. It has both a graphical user interface and a command line interface to allow for batch processing or individual file processing. The open source tool is written in Java and XML and its code can be extended by developers.

Uploaded by

Freddie P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

208 views

Metadata Extraction Tool - Introduction PDF

Uploaded by

Freddie P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

10/24/2017 Metadata Extraction Tool - Introduction

Metadata Extraction Tool

Home Introduction

The Metadata Extraction Tool was developed by the National Library of New Zealand to programmatically extract preservation metadata
Information Sheet from a range of file formats like PDF documents, image files, sound files Microsoft office documents, and many others.

Project page The tool was initially developed in 2003 and released as open source softtware in 2007. The current version can be downloaded from the
SourceForge download page.
Documentation
Purpose of the Metadata Extraction Tool
Screenshots The Tool builds on the Library's work on digital preservation, and its logical preservation metadata schema. It is designed to:

Download automatically extracts preservationrelated metadata from digital files
output that metadata in a standard format (XML) for use in preservation activities.
Bugs
The Tool was designed for preservation processes and activities, but can be used to for other tasks, such as the extraction of metadata for
resource discovery.
Contact
Supported File Formats

The Metadata Extract Tool includes a number of 'adapters' that extract metadata from specific file types. Extractors are currently provided
for:

Images: BMP, GIF, JPEG and TIFF.
Office documents: MS Word (version 2, 6), Word Perfect, Open Office (version 1), MS Works, MS Excel, MS PowerPoint, and PDF.
Audio and Video: WAV, MP3 (normal and with ID3Tags), BFW, FLAC.
Markup languages: HTML and XML.
Internet files: ARC

If a file type is unknown the tool applies a generic adapter, which extracts data that the host system 'knows' about any given file (such as
size, filename, and date created).

Capabilities

The tool has both a Microsoft Windows interface and a UNIX command line interface. This enables work to be automated through batch
processing or processed on an individual basis as required.

The application opens all files as readonly, ensuring the integrity of original files. The tool only reads header information, so the extraction
process is quick.

Open Source Development

The Tool is written in Java and XML and is distributed under the Apache Public License (version 2).

Developers may be interested in extending some of the key components of the Metadata Extraction Tool such as extending existing
adapters or developing new ones to process other file types, or creating new XSLT files to generate different XML output formats.

Please refer to Developers Guide for more information on these components.

http://meta-extractor.sourceforge.net/ 1/1

ICDL Computer Essentials
From Everand
ICDL Computer Essentials
Michael Anderson
4/5 (2)
IPC6325 WD VR Configuration Guide
50% (2)
IPC6325 WD VR Configuration Guide
194 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
C# for Beginners: Learn in 24 Hours
From Everand
C# for Beginners: Learn in 24 Hours
Alex Nordeen
No ratings yet
MadCap Flare for Programmers
From Everand
MadCap Flare for Programmers
Thomas Tregner
5/5 (1)
Mastering the Microsoft Deployment Toolkit
From Everand
Mastering the Microsoft Deployment Toolkit
Jeff Stokes
No ratings yet
SRS - How to build a Pen Test and Hacking Platform
From Everand
SRS - How to build a Pen Test and Hacking Platform
alasdair gilchrist
2/5 (1)
Best Free Open Source Data Recovery Apps for Mac OS English Edition
From Everand
Best Free Open Source Data Recovery Apps for Mac OS English Edition
Cyber Jannah Sakura
No ratings yet
Metadata Extraction Tool Changes
No ratings yet
Metadata Extraction Tool Changes
4 pages
PC Essentials | Learn Basic Computing
From Everand
PC Essentials | Learn Basic Computing
Nolo Nob
No ratings yet
Python File Handling Made Easy: A Practical Guide with Examples
From Everand
Python File Handling Made Easy: A Practical Guide with Examples
William E. Clark
No ratings yet
Software Design And Development in your pocket
From Everand
Software Design And Development in your pocket
David Chen
5/5 (1)
Steps to Technology: Terms and Concepts For Beginners
From Everand
Steps to Technology: Terms and Concepts For Beginners
Ahmed Mosalam
No ratings yet
Software Suite: Revolutionizing Computer Vision with the Ultimate Software Suite
From Everand
Software Suite: Revolutionizing Computer Vision with the Ultimate Software Suite
Fouad Sabry
No ratings yet
Image Collection Exploration: Unveiling Visual Landscapes in Computer Vision
From Everand
Image Collection Exploration: Unveiling Visual Landscapes in Computer Vision
Fouad Sabry
No ratings yet
HackerTools Crack With Disassembling
From Everand
HackerTools Crack With Disassembling
Omega Brdarevic
2.5/5 (3)
C++ File Handling Step by Step: A Practical Guide with Examples
From Everand
C++ File Handling Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Config File Types
From Everand
Config File Types
Frank Wellington
No ratings yet
Changes v3 0
No ratings yet
Changes v3 0
3 pages
WiX: A Developer's Guide to Windows Installer XML
From Everand
WiX: A Developer's Guide to Windows Installer XML
Ramirez Nick
No ratings yet
Efficient Workflows with Notepad++: Definitive Reference for Developers and Engineers
From Everand
Efficient Workflows with Notepad++: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Metadata Extraction
No ratings yet
Metadata Extraction
83 pages
Free Opensource Office Suite Software Apps For Windows 11 OS
From Everand
Free Opensource Office Suite Software Apps For Windows 11 OS
Cyber Jannah Sakura
No ratings yet
Operating Systems: Concepts to Save Money, Time, and Frustration
From Everand
Operating Systems: Concepts to Save Money, Time, and Frustration
Jonathan Rigdon
No ratings yet
.Net Framework and Programming in ASP.NET
From Everand
.Net Framework and Programming in ASP.NET
Priyanka Agarwal
No ratings yet
Best Free Open Source Office Software For Windows 10 Bilingual Edition English Germany
From Everand
Best Free Open Source Office Software For Windows 10 Bilingual Edition English Germany
Cyber Jannah Sakura
No ratings yet
Touchpad Modular Ver. 1.1 Class 7: Windows 7 & MS Office 2010
From Everand
Touchpad Modular Ver. 1.1 Class 7: Windows 7 & MS Office 2010
Team Orange
No ratings yet
How To Create An App
From Everand
How To Create An App
Duong Tran
3/5 (8)
How To Program A Mobile Game
From Everand
How To Program A Mobile Game
Duong Tran
4/5 (1)
Java File Handling Step by Step: A Practical Guide with Examples
From Everand
Java File Handling Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Free Antivirus And Antimalware Software For Ubuntu & Linux Mint
From Everand
Free Antivirus And Antimalware Software For Ubuntu & Linux Mint
Cyber Jannah Studio
No ratings yet
Python Automation for Beginners: A Practical Guide with Examples
From Everand
Python Automation for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Free Open Source Linux OS For Data Recovery & Data Rescue Bilingual Version Ultimate
From Everand
Free Open Source Linux OS For Data Recovery & Data Rescue Bilingual Version Ultimate
Cyber Jannah Sakura
No ratings yet
USB Mass Storage: Designing and Programming Devices and Embedded Hosts
From Everand
USB Mass Storage: Designing and Programming Devices and Embedded Hosts
Jan Axelson
No ratings yet
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Python Programming: Learn, Code, Create
From Everand
Python Programming: Learn, Code, Create
Sachin Naha
No ratings yet
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
From Everand
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
Christopher Right
2.5/5 (2)
Computer Applications: The Beginner's Guide
From Everand
Computer Applications: The Beginner's Guide
Edafe
No ratings yet
Angular Workshop: From Beginner to Pro, Creating Applications for the Real World
From Everand
Angular Workshop: From Beginner to Pro, Creating Applications for the Real World
Abdelfattah Ragab
No ratings yet
Essential Python 3
From Everand
Essential Python 3
Kevin Vans-Colina
No ratings yet
INI Format Explained
From Everand
INI Format Explained
Isabella Ramirez
No ratings yet
Building an Operating System with Rust: A Practical Guide
From Everand
Building an Operating System with Rust: A Practical Guide
Robert Johnson
No ratings yet
20 Windows Tools Every SysAdmin Should Know
From Everand
20 Windows Tools Every SysAdmin Should Know
padmin
5/5 (2)
“Information Systems Unraveled: Exploring the Core Concepts”: GoodMan, #1
From Everand
“Information Systems Unraveled: Exploring the Core Concepts”: GoodMan, #1
Patrick Mukosha
No ratings yet
List Anti Rootkit & AntiVirus For Ubuntu, Linux & BSD (Edition 2018)
From Everand
List Anti Rootkit & AntiVirus For Ubuntu, Linux & BSD (Edition 2018)
Muhammad Vandestra
No ratings yet
Linux 5 Day Introduction Course
From Everand
Linux 5 Day Introduction Course
Stephen Edwards
No ratings yet
Linux Services Deployment
From Everand
Linux Services Deployment
Fabian Mestre
No ratings yet
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
Spring 2.5 Aspect Oriented Programming
From Everand
Spring 2.5 Aspect Oriented Programming
Massimiliano DessÃ¬
No ratings yet
Linux System Programming: From Basics to Expert Proficiency
From Everand
Linux System Programming: From Basics to Expert Proficiency
William Smith
No ratings yet
Python For Data Science
From Everand
Python For Data Science
Kevin Clark
No ratings yet
Rust for Beginners
From Everand
Rust for Beginners
Hernando Abella
No ratings yet
Mastering Python Programming: A Comprehensive Guide: The IT Collection
From Everand
Mastering Python Programming: A Comprehensive Guide: The IT Collection
Christopher Ford
5/5 (1)
Beginning XML
From Everand
Beginning XML
Joe Fawcett
3/5 (1)
Metadata Extraction Tool: Installation Guide
No ratings yet
Metadata Extraction Tool: Installation Guide
8 pages
Zorin OS Administration and User Guide: Definitive Reference for Developers and Engineers
From Everand
Zorin OS Administration and User Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
From Everand
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
Steven Mcananey
No ratings yet
Metadata Assistant Quick Start Guide
0% (1)
Metadata Assistant Quick Start Guide
7 pages
WAN Architectures and Design Principles BRKRST-2041
100% (1)
WAN Architectures and Design Principles BRKRST-2041
91 pages
Rakshasa Whitepaper
No ratings yet
Rakshasa Whitepaper
13 pages
Look at This Part 3 Question and The Three Answers Below. Which Answer Do You Think Is Best? Why?
No ratings yet
Look at This Part 3 Question and The Three Answers Below. Which Answer Do You Think Is Best? Why?
2 pages
Xmax Technology
100% (1)
Xmax Technology
9 pages
Fibeair Ip 20g
No ratings yet
Fibeair Ip 20g
2 pages
Barnali Home Automation Project
No ratings yet
Barnali Home Automation Project
44 pages
Apple Laptop Price List
No ratings yet
Apple Laptop Price List
130 pages
8050 Service Bulletins
No ratings yet
8050 Service Bulletins
132 pages
Linux Training
100% (1)
Linux Training
35 pages
Help The Bunny Find His Carrot!
No ratings yet
Help The Bunny Find His Carrot!
5 pages
What Is Koha?
100% (4)
What Is Koha?
26 pages
Multiple Access Protocols
No ratings yet
Multiple Access Protocols
54 pages
BRKSEC2004
No ratings yet
BRKSEC2004
214 pages
FortiOS 7.0.1 Administration Guide
No ratings yet
FortiOS 7.0.1 Administration Guide
2,150 pages
Dual Band Wi Fi Extender 600 User Guide
No ratings yet
Dual Band Wi Fi Extender 600 User Guide
6 pages
2638 Daikin Heat Pump Fault Codes LR Tcm219-196577
No ratings yet
2638 Daikin Heat Pump Fault Codes LR Tcm219-196577
2 pages
MPC LED Indication PDF
No ratings yet
MPC LED Indication PDF
2 pages
Cisco Actualtests 350-801 Exam Question 2022-Dec-06 by Ronald 78q Vce
No ratings yet
Cisco Actualtests 350-801 Exam Question 2022-Dec-06 by Ronald 78q Vce
8 pages
MA5616 Configuration Script
No ratings yet
MA5616 Configuration Script
5 pages
Module 10 Server Management
No ratings yet
Module 10 Server Management
25 pages
iPECS IP Phone - 1000i Series: Ericsson-LG Enterprise
No ratings yet
iPECS IP Phone - 1000i Series: Ericsson-LG Enterprise
26 pages
Statement of Purpose (Internetworking and Cyber Security) : Need Help With The Assignment?
No ratings yet
Statement of Purpose (Internetworking and Cyber Security) : Need Help With The Assignment?
2 pages
DCP b7535dw PDF
No ratings yet
DCP b7535dw PDF
2 pages
2023 24 ODD CE262 DCN Syllabus
No ratings yet
2023 24 ODD CE262 DCN Syllabus
5 pages
APD4 Install E RevU
No ratings yet
APD4 Install E RevU
88 pages
High Availability and Site Resilience in Exchange Server
No ratings yet
High Availability and Site Resilience in Exchange Server
5 pages
CEH Pentesting Tools: Sniffers
No ratings yet
CEH Pentesting Tools: Sniffers
5 pages
Author Registration Form
No ratings yet
Author Registration Form
2 pages
Pc202 Full Datasheet
No ratings yet
Pc202 Full Datasheet
97 pages

Metadata Extraction Tool - Introduction PDF

Uploaded by

Metadata Extraction Tool - Introduction PDF

Uploaded by

10/24/2017 Metadata Extraction Tool - Introduction

You might also like