XML Metadata: Dep't of Information Science: INSC2092
XML Metadata: Dep't of Information Science: INSC2092
XML Metadata: Dep't of Information Science: INSC2092
4 07/11/2022
XML cont’d…
<note>
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
</note>
5 07/11/2022
XML cont’d…
6 07/11/2022
XML cont’d…
XML Simplifies Data Sharing:
In the real world, computer systems and databases contain
data in incompatible formats.
XML data is stored in plain text format. This provides a
software- and hardware-independent way of storing data.
This makes it much easier to create data that can be shared
by different applications.
7 07/11/2022
XML cont’d….
8 07/11/2022
XML cont’d…
9 07/11/2022
XML
cont’d…
10 07/11/2022
XML
cont’d…
11 07/11/2022
XML
cont’d…
12 07/11/2022
XML
cont’d…
13 07/11/2022
XML
cont’d…
14 07/11/2022
XML cont’d…
15 07/11/2022
XML cont’d…
<root>
<child>
<sub child>.....</sub child>
</child>
</root>
16 07/11/2022
XML cont’d…
17 07/11/2022
XML cont’d…
Entity References:
Some characters have a special meaning in XML.
If you place a character like "<" inside an XML element, it will
generate an error because the parser interprets it as the start
of a new element.
This will generate an XML error:
<message>if salary < 1000 then</message>
To avoid this error, replace the "<" character with an entity
reference:
<message>if salary < 1000 then</message>
There are 5 predefined entity references in XML:
< < less than > > greater than & & ampersand
' ' apostrophe " " quotation mark
18 07/11/2022
cont’d…
Comments in XML:
The syntax for writing comments in XML is similar
to that of HTML.
19 07/11/2022
cont’d…
20 07/11/2022
Metadata
What is metadata?
21 07/11/2022
Metadata cont’d…
22 07/11/2022
Metadata cont’d…
23 07/11/2022
Metadata cont’d…
24 07/11/2022
cont’d…
25 07/11/2022
cont’d…
26 07/11/2022
cont’d…
Benefits of Metadata:
27 07/11/2022
cont’d…
28 07/11/2022
cont’d…
For Organizations:
29 07/11/2022
Types of metadata
30 07/11/2022
cont’d…
31 07/11/2022
Functions of Metadata
Resource discovery :
Allowing resources to be found by relevant criteria;
Identifying resources;
Bringing similar resources together;
Distinguishing dissimilar resources;
Giving location information.
Organizing e-resources:
Organizing links to resources based on audience or topic.
Building these pages dynamically from metadata stored in
databases.
32 07/11/2022
cont’d…
Facilitating interoperability :
Using defined metadata schemes, shared transfer protocols,
and crosswalks between schemes, resources across the
network can be searched more seamlessly.
Cross-system search. e.g., using Z39.50 protocol. (remote)
Metadata harvesting. e.g. OAI protocol.
33 07/11/2022
cont’d…
Digital identification :
Elements for standard numbers, e.g., ISBN
The location of a digital object may also be given using:
a file name
a URL
some persistent identifiers, e.g., PURL (Persistent URL);
DOI (Digital Object Identifier) Combined metadata to act
as a set of identifying data, differentiating one object from
another for validation purposes.
34 07/11/2022
cont’d…
35 07/11/2022
Metadata Life Cycle
36 07/11/2022
cont’d…
Metadata Collection
Right metadata at the right time
Variety of collection strategies
Sources
potential sources of data for DW
external data
data structures
Data Models: enterprise data model start point
import from CASE tool
correlate enterprise and warehouse models
37 07/11/2022
cont’d…
Metadata Deployment:
Warehouse developers need:
physical structure information for data sources
enterprise data model
warehouse data model
concerned with accuracy, completeness and flexibility of
metadata
Need access to comprehensive impact analysis capabilities
Need to defend against accuracy & integrity questions
38 07/11/2022
Dublin Core
39 07/11/2022
Dublin Core Metadata Elements
40 07/11/2022
cont’d…
Source
Subject
Title
43 07/11/2022
cont’d…
44 07/11/2022
Characteristics of Dublin Core
Simplicity
Semantic Interoperability
International Consensus
Extensibility
45 07/11/2022
Simple Dublin Core
<metadata>
<dc:title>Cataloging cultural objects,</dc:title>
<dc:contributor>Baca, Murtha.</dc:contributor>
<dc:contributor>Harpring, Patricia./dc:contributor>
<dc:subject>Information organization</dc:subject>
<dc:subject>Metadata</dc:subject>
<dc:subject>Cultural property--Documentation</dc:subject>
<dc:subject>CC135.C37 2006</dc:subject>
<dc:subject>363.6</dc:subject>
<dc:date>2006</dc:date>
<dc:format>396 p.</dc:format>
<dc:type>Text</dc:type>
<dc:identifier>ISBN:0838935648</dc:identifier>
<dc:language>en</dc:language>
<dc:publisher>ALA Editions</dc:publisher>
</metadata>
46 07/11/2022
Qualified Dublin Core
<metadata>
<dc:title xml:lang="en">Cataloging cultural objects.</dc:title>
<dc:contributor>Baca, Murtha.</dc:contributor>
<dc:contributor>Harpring, Patricia.</dc:contributor>
<dc:subject xsitype="LCSH">Information organization</dc:subject>
<dc:subject xsitype="LCSH">Metadata</dc:subject>
<dc:subject xsitype="LCSH">Cultural property--Documentation</dc:subject>
<dc:subject xsitype="LCC">CC135.C37 2006</dc:subject>
<dc:subject xsitype="DDC">363.3</dc:subject>
<dc:date xsitype="W3CDTF">2006</dc:date>
<dcterms:extent>396 p.</dcterms:extent>
<dc:type xsitype="DCMIType">Text</dc:type>
<dc:identifier xsitype="URI">ISBN: 0838935648 </dc:identifier>
<dc:language xsitype="RFC3066">en</dc:language>
<dc:publisher>ALA Editions</dc:publisher>
<dcterms:audience>Catalogers</dcterms:audience>
</metadata>
47 07/11/2022
RDF
RDF(Resource Description Framework):- is a W3C standard for
describing Web resources, such as the title, author, modification
date, content, and copyright information of a Web page.
What is RDF?
It stands for Resource Description Framework.
is a framework for describing resources on the web
is designed to be read and understood by computers
is not designed for being displayed to people.
RDF descriptions are not designed to be displayed on the web.
is written in XML.
is a part of the W3C's Semantic Web Activity
is a W3C Recommendation.
48 07/11/2022
cont’d…
49 07/11/2022
RDF Rules:
50 07/11/2022
cont’d…
<?xml version="1.0"?>
<RDF>
<Description about="http://www.w3schools.com/rdf">
<author>Jan Egil Refsnes</author>
<homepage>http://www.w3schools.com</homepage>
</Description>
</RDF>
51 07/11/2022
RDF Statements:
52 07/11/2022
cont’d…
RDF Example:
Here are two records from a CD-list:
Empire
Bob Dylan USA Columbia 10.90 1985
Burlesque
Bonnie CBS
Hide your heart UK 9.90 1988
Tyler Records
53 07/11/2022
cont’d…
<?xml version="1.0"?>
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:cd="http://www.recshop.fake/cd#">
<rdf:Description
rdf:about="http://www.recshop.fake/cd/Empire Burlesque">
<cd:artist>Bob Dylan</cd:artist>
<cd:country>USA</cd:country>
<cd:company>Columbia</cd:company>
<cd:price>10.90</cd:price>
<cd:year>1985</cd:year>
</rdf:Description>
<rdf:Description
rdf:about="http://www.recshop.fake/cd/Hide your heart">
<cd:artist>Bonnie Tyler</cd:artist>
<cd:country>UK</cd:country>
<cd:company>CBS Records</cd:company>
<cd:price>9.90</cd:price>
<cd:year>1988</cd:year>
</rdf:Description>
</rdf:RDF>
07/11/2022
cont’d…
The first line of the RDF document is the XML declaration. The
XML declaration is followed by the root element of RDF
documents: <rdf:RDF>.
The xmlns:rdf namespace, specifies that elements with the rdf
prefix are from the namespace
"http://www.w3.org/1999/02/22-rdf-syntax-ns#".
The xmlns:cd namespace, specifies that elements with the cd
prefix are from the namespace "http://www.recshop.fake/cd#".
The <rdf:Description> element contains the description of the
resource identified by the rdf:about attribute.
The elements: <cd:artist>, <cd:country>, <cd:company>, etc.
are properties of the resource.
55 07/11/2022
FRBR
56 07/11/2022
cont’d…
Entity-Relationship Model
Entities: Group 1, 2, 3
Relationships
Attributes
User Tasks:
Find
Identify
Select
Obtain
Navigate
57 07/11/2022
cont’d…
58 07/11/2022
cont’d…
60 07/11/2022
cont’d…
61 07/11/2022
cont’d…
Relationships
Entity 1 Entity 2
62 07/11/2022