Publishing with XML: Structure, enter, publish

Ebook454 pages2 hours

Publishing with XML: Structure, enter, publish

Name: Publishing with XML: Structure, enter, publish
Author: Ligaran
ISBN: 9782335086522

By Ligaran and Bernard Prost

Rating: 0 out of 5 stars

()

Read preview

About this ebook

XML is now at the heart of book publishing techniques: it provides the industry with a robust, flexible format which is relatively easy to manipulate. Above all, it preserves the future: the XML text becomes a genuine tactical asset enabling publishers to respond quickly to market demands. When new publishing media appear, it will be possible to very quickly make your editorial content available at a lower cost. On the downside, XML can become a bottomless pit for publishers attracted by its possibilities. There is a strong temptation to switch to audiovisual production and to add video and animation to what we currently call a book, i.e. a written, relatively linear discourse representing a series of ideas. Publishers cannot ignore technology, however. It is better to recognize the threats of innovation and to maintain your business and your convictions by boarding the e-publishing ship. But make sure you carry a life preserver, XML, to ride above the waves of modern times.

À PROPOS DES ÉDITIONS LIGARAN

Les éditions LIGARAN proposent des versions numériques de qualité de grands livres de la littérature classique mais également des livres rares en partenariat avec la BNF. Beaucoup de soins sont apportés à ces versions ebook pour éviter les fautes que l'on trouve trop souvent dans des versions numériques de ces textes.

LIGARAN propose des grands classiques dans les domaines suivants :

• Livres rares
• Livres libertins
• Livres d'Histoire
• Poésies
• Première guerre mondiale
• Jeunesse
• Policier

Skip carousel

LanguageEnglish

PublisherLigaran

Release dateJun 19, 2015

ISBN9782335086522

Author

Ligaran

Related authors

Skip carousel

Related to Publishing with XML

Related ebooks

Skip carousel

Handcraft Epub in 7 Steps
Ebook
Handcraft Epub in 7 Steps
byMundy Obilor Jim
Rating: 0 out of 5 stars
0 ratings
Learning HTML5 by Creating Fun Games
Ebook
Learning HTML5 by Creating Fun Games
byRodrigo Silveira
Rating: 4 out of 5 stars
4/5
Academic E-Books: Publishers, Librarians, and Users
Ebook
Academic E-Books: Publishers, Librarians, and Users
bySuzanne M. Ward
Rating: 3 out of 5 stars
3/5
Learning Adobe Muse
Ebook
Learning Adobe Muse
byJennifer Farley
Rating: 4 out of 5 stars
4/5
Visual Language for the World Wide Web
Ebook
Visual Language for the World Wide Web
byPaul Honeywill
Rating: 0 out of 5 stars
0 ratings
Principles of Web Design
Ebook
Principles of Web Design
byBrian D Miller
Rating: 0 out of 5 stars
0 ratings
Real-World Solutions for Developing High-Quality PHP Frameworks and Applications
Ebook
Real-World Solutions for Developing High-Quality PHP Frameworks and Applications
bySebastian Bergmann
Rating: 3 out of 5 stars
3/5
openFrameworks Essentials
Ebook
openFrameworks Essentials
byDenis Perevalov
Rating: 0 out of 5 stars
0 ratings
Mini Style Guide: An Introduction to Good Writing and Manuscript Presentation
Ebook
Mini Style Guide: An Introduction to Good Writing and Manuscript Presentation
byDenise O'Hagan
Rating: 0 out of 5 stars
0 ratings
Content Strategy: Connecting the dots between business, brand, and benefits
Ebook
Content Strategy: Connecting the dots between business, brand, and benefits
byRahel Anne Bailie
Rating: 0 out of 5 stars
0 ratings
PHP 5 CMS Framework Development - 2nd Edition
Ebook
PHP 5 CMS Framework Development - 2nd Edition
byMartin Brampton
Rating: 0 out of 5 stars
0 ratings
Every Page is Page One
Ebook
Every Page is Page One
byMark Baker
Rating: 3 out of 5 stars
3/5
Ebooks and Editors: What you need to know
Ebook
Ebooks and Editors: What you need to know
byKevin Callahan
Rating: 0 out of 5 stars
0 ratings
Ultimate Tailwind CSS Handbook: Build sleek and modern websites with immersive UIs using Tailwind CSS
Ebook
Ultimate Tailwind CSS Handbook: Build sleek and modern websites with immersive UIs using Tailwind CSS
byKartik Bhat
Rating: 0 out of 5 stars
0 ratings
Technical Writing for Business and Engineering Professionals
Ebook
Technical Writing for Business and Engineering Professionals
byAkram Najjar
Rating: 0 out of 5 stars
0 ratings
WordPress Bible
Ebook
WordPress Bible
byAaron Brazell
Rating: 3 out of 5 stars
3/5
Front Matter, Back Matter, and Metadata
Ebook
Front Matter, Back Matter, and Metadata
byPaul Salvette
Rating: 5 out of 5 stars
5/5
The Book Blueprint: Expert Advice for Creating Industry-Standard Print Books
Ebook
The Book Blueprint: Expert Advice for Creating Industry-Standard Print Books
byJoel Friedlander
Rating: 0 out of 5 stars
0 ratings
Optical Character Recognition: Fundamentals and Applications
Ebook
Optical Character Recognition: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
World without history? Digital information is volatile: with it our culture can disappear but its preservation can save us
Ebook
World without history? Digital information is volatile: with it our culture can disappear but its preservation can save us
byStefano Cariolato
Rating: 0 out of 5 stars
0 ratings
WordPress 3 For Business Bloggers
Ebook
WordPress 3 For Business Bloggers
byPaul Thewlis
Rating: 5 out of 5 stars
5/5
The ebook factory: Strategies, ideas and operational instructions for creating income streams through writing and publishing an ebook
Ebook
The ebook factory: Strategies, ideas and operational instructions for creating income streams through writing and publishing an ebook
byStefano Calicchio
Rating: 0 out of 5 stars
0 ratings
Above the Fold: Understanding the Principles of Successful Web Site Design
Ebook
Above the Fold: Understanding the Principles of Successful Web Site Design
byBrian D Miller
Rating: 4 out of 5 stars
4/5
HTML5 Games: Creating Fun with HTML5, CSS3 and WebGL
Ebook
HTML5 Games: Creating Fun with HTML5, CSS3 and WebGL
byJacob Seidelin
Rating: 0 out of 5 stars
0 ratings
Scrolling: Unlocking the Visual World of Computer Vision
Ebook
Scrolling: Unlocking the Visual World of Computer Vision
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Mastering Responsive Web Design with HTML5 and CSS3
Ebook
Mastering Responsive Web Design with HTML5 and CSS3
byRicardo Zea
Rating: 0 out of 5 stars
0 ratings
Content Based Image Retrieval: Unlocking Visual Databases
Ebook
Content Based Image Retrieval: Unlocking Visual Databases
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Abbreviations and Signs A Primer of Information about Abbreviations and Signs, with Classified Lists of Those in Most Common Use
Ebook
Abbreviations and Signs A Primer of Information about Abbreviations and Signs, with Classified Lists of Those in Most Common Use
byFrederick W. (Frederick William) Hamilton
Rating: 0 out of 5 stars
0 ratings
How This Book Was Made & How You Can Make Your Own (NEW EDITION)
Ebook
How This Book Was Made & How You Can Make Your Own (NEW EDITION)
byMaria B. O'Hare
Rating: 0 out of 5 stars
0 ratings
New Business Models in the Digital Age
Ebook
New Business Models in the Digital Age
byJavier Celaya
Rating: 0 out of 5 stars
0 ratings

Programming For You

Skip carousel

Access 2019 Bible
Ebook
Access 2019 Bible
byMichael Alexander
Rating: 5 out of 5 stars
5/5
HTML, CSS, and JavaScript Mobile Development For Dummies
Ebook
HTML, CSS, and JavaScript Mobile Development For Dummies
byWilliam Harrel
Rating: 4 out of 5 stars
4/5
SQL All-in-One For Dummies
Ebook
SQL All-in-One For Dummies
byAllen G. Taylor
Rating: 3 out of 5 stars
3/5
PHP, MySQL, & JavaScript All-in-One For Dummies
Ebook
PHP, MySQL, & JavaScript All-in-One For Dummies
byRichard Blum
Rating: 5 out of 5 stars
5/5
Microsoft Publisher Guide to Success: Learn In A Guided Way How To Format your Page Layout and Graphic Design To Optimize Your Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #9
Ebook
Microsoft Publisher Guide to Success: Learn In A Guided Way How To Format your Page Layout and Graphic Design To Optimize Your Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #9
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Coding for Kids Ages 9-15: Simple HTML, CSS and JavaScript lessons to get you started with Programming from Scratch
Ebook
Coding for Kids Ages 9-15: Simple HTML, CSS and JavaScript lessons to get you started with Programming from Scratch
byBob Mather
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 5 out of 5 stars
5/5
JavaScript All-in-One For Dummies
Ebook
JavaScript All-in-One For Dummies
byChris Minnick
Rating: 5 out of 5 stars
5/5
Microsoft OneNote Guide to Success: Boost Your Productivity, Organize Your Notes & Ideas, and Manage Tasks Like a Pro
Ebook
Microsoft OneNote Guide to Success: Boost Your Productivity, Organize Your Notes & Ideas, and Manage Tasks Like a Pro
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Unity from Zero to Proficiency (Foundations) Fifth Edition: Unity from Zero to Proficiency, #1
Ebook
Unity from Zero to Proficiency (Foundations) Fifth Edition: Unity from Zero to Proficiency, #1
byPatrick Felicia
Rating: 5 out of 5 stars
5/5
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]
Ebook
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Python Projects for Everyone
Ebook
Python Projects for Everyone
byMohamad Charara
Rating: 0 out of 5 stars
0 ratings
iPhone Made Simple for Seniors & Beginners – Full Color Visual Guide: Step-by-Step Instructions to Take Control & Stay Connected with Confidence
Ebook
iPhone Made Simple for Seniors & Beginners – Full Color Visual Guide: Step-by-Step Instructions to Take Control & Stay Connected with Confidence
byKevin Pitch
Rating: 5 out of 5 stars
5/5
iPhone 14 Guide for Seniors: Unlocking Seamless Simplicity for the Golden Generation with Step-by-Step Screenshots
Ebook
iPhone 14 Guide for Seniors: Unlocking Seamless Simplicity for the Golden Generation with Step-by-Step Screenshots
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Microsoft Azure For Dummies
Ebook
Microsoft Azure For Dummies
byJack A. Hyman
Rating: 0 out of 5 stars
0 ratings
Responsive Web Design with HTML5 and CSS3 Essentials
Ebook
Responsive Web Design with HTML5 and CSS3 Essentials
byAsoj Talesra
Rating: 5 out of 5 stars
5/5
Python: Learn Python in 24 Hours
Ebook
Python: Learn Python in 24 Hours
byAlex Nordeen
Rating: 4 out of 5 stars
4/5
Learn PHP in 24 Hours
Ebook
Learn PHP in 24 Hours
byAlex Nordeen
Rating: 0 out of 5 stars
0 ratings
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Hands-on DevOps with Linux: Build and Deploy DevOps Pipelines Using Linux Commands, Terraform, Docker, Vagrant, and Kubernetes (English Edition)
Ebook
Hands-on DevOps with Linux: Build and Deploy DevOps Pipelines Using Linux Commands, Terraform, Docker, Vagrant, and Kubernetes (English Edition)
byAlisson Machado de Menezes
Rating: 0 out of 5 stars
0 ratings
Modern C++ Programming Cookbook
Ebook
Modern C++ Programming Cookbook
byMarius Bancila
Rating: 5 out of 5 stars
5/5
Python Data Structures and Algorithms
Ebook
Python Data Structures and Algorithms
byBenjamin Baka
Rating: 5 out of 5 stars
5/5
Mastering JavaScript: The Complete Guide to JavaScript Mastery
Ebook
Mastering JavaScript: The Complete Guide to JavaScript Mastery
byTim Robards
Rating: 5 out of 5 stars
5/5
Learn SAP Basis in 24 Hours
Ebook
Learn SAP Basis in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Mastering Deep Learning with Keras: From Basics to Expert Proficiency
Ebook
Mastering Deep Learning with Keras: From Basics to Expert Proficiency
byWilliam Smith
Rating: 0 out of 5 stars
0 ratings
Beginning Programming with C++ For Dummies
Ebook
Beginning Programming with C++ For Dummies
byStephen R. Davis
Rating: 4 out of 5 stars
4/5
Deep Reinforcement Learning: An Essential Guide
Ebook
Deep Reinforcement Learning: An Essential Guide
byRobert Johnson
Rating: 0 out of 5 stars
0 ratings
Microsoft SharePoint Guide to Success: Learn In A Guided Way How To Manage and Store Files to Optimize Your Organization, Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #10
Ebook
Microsoft SharePoint Guide to Success: Learn In A Guided Way How To Manage and Store Files to Optimize Your Organization, Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #10
byKevin Pitch
Rating: 5 out of 5 stars
5/5
HTML, CSS, & JavaScript All-in-One For Dummies
Ebook
HTML, CSS, & JavaScript All-in-One For Dummies
byPaul McFedries
Rating: 0 out of 5 stars
0 ratings

Related categories

Skip carousel

Reviews for Publishing with XML

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Publishing with XML - Ligaran

etc/frontcover.jpg

Bernard Prost

Publishing with XML

Ligaran Publishing

2015

EAN : 9782335086522

71100 Chalon-sur-Saône

FRANCE

Acknowledgments

Summarizing the relation between XML and publishing in a short book is a difficult task, and I could never have carried it out on my own. First I wish to thank some key people at Editions Eyrolles (the publisher of the French edition of this book): my editor Stéphanie Poisson and her team, as well as Véronique Dürr who helped her with the proofreading. They have the art of giving meaning to my thoughts which occasionally get overwhelmed by technology.

I also wish to thank all those who worked with me on XML:

– the shareholders of Ligaran: Alain Pierrot, a remarkable designer of advanced taxonomies, connoisseur of the Open Office suite, XSLT author, and an expert in book scanning; Xavier Maurin, the code and graphic wiz at MyBookForge.com, who has a brilliant view of the consumer digital world; Olivier Desnoux, a software developer with impeccable methodology, author of elegant (and legible!) code, co-designer of the MyBookForge transformation engine; Adrien Vieilleribière, talented researcher, major XSLT artist able to put just about anything online and make XML transformation to any format accessible to all, who also co-designed the MyBookForge transformation engine; Patrick Pierre, a talented engineer and one of the most advanced minds in publication technology—his mastery of IDML (barely discussed in this book) is remarkable; and Hugues Cochard, serial-creator of high-tech companies, currently in Tahiti but very present via the Web.

– all those who trusted me with their professional or scientific projects, notably Mai Nguyen and Lionel Ridoux who know everything about medication and XML.

– two friends met along the way: Christian Brugeron for his clever scripts designed to work around the limitations of just about any page layout software—starting with InDesign; and Benoît Leprince who provided various examples of InDesign layouts used to illustrate this book.

Thanks to all those in the brand new e-book ecosystem which should take off at an astounding rate worldwide and perhaps in France as well: notably to Houriah Ghebalou (PREMICE, the regional business incubator in Burgundy) who financed the preliminary research for the Ligaran/Mybookforge project; the Burgundy region, which supported the project and its local set-up; and Nicéphore Cité, our home away from home in Chalon-sur-Saône which assists image and audio start-ups.

Finally I would like to thank Ray Charles, who understood that the medium influences the message: without the need to flip the 45 RPM record to listen to the other half, the famous break in What I’d Say would not exist!

Foreword

Wait a minute, wait a minute, oh hold it! Hold it!

---

Hey (hey) ho (ho) hey (hey) ho (ho) hey (hey) ho (ho) hey

Ray Charles (What I’d Say)

If everything is under control, you are going too slow

Mario Andretti

The world of publishing is going through a sea change. Paper books are facing competition from an ever-expanding range of virtual devices: the Web (obviously); the compact, powerful, and aptly-named netbooks; and especially mobile phones and other nomadic devices like e-book readers and notepads which make complete professional and literary libraries available to all, urbi et orbi. Now publishers need to deliver content for these media, making use of their specific features while minimizing both costs and production lead times. At first publishers had to make several revisions of the same content for different target media. But today publishers are adopting a more industrial—yet also more standardized and restrictive—approach based on XML.

The flexible and universal nature of XML has attracted publishers—first and foremost those specializing in legal publications, who are used to working with SGML—as well as programmers, who can use the language to exchange data between a wide range of computer systems.

DEFINITION e-reader

A portable device for reading electronic books (e-books). An e-reader is a hardware device using display technology called e-paper, the marketing term for a non-backlit screen requiring minimal energy and reputedly less tiring for the eyes. Along with e-paper, marketers have coined the term e-ink to describe a pixel...

The Extensible Markup Language (XML), standardized in 1999, has reached maturity. An XML ecosystem has emerged populated by specialized software (XML editors), on-shore, near-shore and off-shore service providers specializing in the language; application developers able to use the Document Object Model (DOM) to create innovative electronic products, and industry-specific document models for various types of publications.

DEFINITION DOM

The Document Object Model is a tree-based IT model for XML or HTML documents. DOM is independent of all other taxonomies. The DOM enables programs to manipulate document components.

Nevertheless, XML usage has not yet stabilized and practices vary among publishers. The purpose of this book is to provide a practical overview of how publishers can use XML, based on concrete, tested methods which, by nature, are limited to specific cases. Publishing with XML is neither a bible nor a dogmatic treatise on the subject, and readers can adapt the examples provided to suit their needs.

How the book is organized

This book includes three parts—Structure, Enter, and Publish—covering the entire XML cycle for publishing an e-book. Publishing with XML is mainly intended for publishers, editors/proofreaders, and production managers. But it also addresses managers wishing to understand the underlying techniques, and to comprehend how the medium influences the design and format of digital publications. Authors curious to learn more about XML's possibilities can also discover new ways to design their composition.

The book frequently refers to a sample encyclopedia article, similar to those found in Wikipedia. The example is based on a structure developed specifically for this publication (

article_v1.2.dtd

). The example meets simple editorial requirements:

– be able to publish the article in paper format, on the Web, or on a smartphone.

– include interactive publication objects regarding authors, bibliographies, filmographies and discographies. The interactive features must be independent of target databases.

For simplicity's sake, this book does not contain tables or mathematical formulas (except for a few included as images).

Structuring with XML

The first chapter focuses on document modeling and the XML markup method. The following chapter describes the main structures found in a publication, or more generally in a document. Chapter Three shows how to write a DTD, i.e. the simplest way of representing a taxonomy.

DEFINITION Taxonomy

A set of tags used for encoding a document in XML. The taxonomy is usually written in a specialized language (such as DTD, XML Schema, or Relax NG).

Entering XML markup

Chapter Four concerns the actual entry of XML tags. In most cases, this job is outsourced, but publishers increasingly need to be able to modify a document using an XML editor in-house in order to correct minor errors or to make last-minute changes. This chapter focuses on configuring a commercial XML editor and using it with a specific DTD.

Chapter Five examines the relation with subcontractors: how to prepare the text to minimize errors when interpreting the structure, and how to create effective instructions.

Chapter Six discusses a step rarely described in the production process: proofing XML. It shows how to make sure the XML provided by the subcontractor meets the publisher's needs. This chapter also covers the various XML production models used for XML entry either before, during, or after the paper page layout.

Publishing

Chapter Seven provides an overview of the techniques for transforming an XML document into a target format, including XML itself (e.g. input for InDesign), XHTML, or any other text format. Although highly technical, there is nothing mysterious about the XSLT transformation language. It is important for those involved in publishing to understand the mechanism in order to appreciate the impact of editorial decisions.

Chapter Eight briefly describes publishing on electronic media, but limits the discussion to the Web, e-readers, and the iPhone (currently the most advanced phone-based e-reader).

Finally, Chapter Nine investigates two approaches to paper-based publishing using an XML document:

– directly transforming XML into PDF using XSL-FO, a page layout language written in XML

– directly importing XML using a DTP tool (such as InDesign)

This book provides the keys to using XML in the editing process, but presents only the bare essentials of this modern publishing method. Interested readers can find books dedicated to each of these techniques.

NOTE

XML terminology is relatively opaque. Many terms include references to SGML, style sheets, etc. but have lost their original meaning and the terms no longer reflect their actual role. You will need to apply them regardless of their usual meaning in English.

Chapter 1

Separating content from format

The crucial challenge for publishers is how to build a methodology for publishing across a wide range of current or future media, with a single markup process performed either before or after publication, and at the lowest possible cost. The first step in this process is to separate content from format, far beyond the techniques of word processor style sheets.

Modeling a document

A book, or more generally any document in XML format, requires a sufficiently general model adapted to all likely publishing scenarios. You create an abstract model for a set or a class of documents and then submit them to a common computer process.

Identifying the three aspects of a document

Once you become familiar with XML, you will never look at a document the same way. The content of a document is created by juxtaposing words (without any typographical enrichment) and the document's form (which partially highlights the author's thoughts). But the structure is a new document component providing features which depend specifically on the planned use of the paper and electronic editions.

The content

The content is the text, i.e. what you read; it is independent of the format. The version with the least amount of format is an audio recording: each word only has its semantic value and is not supported by any typographical variations, although a few audio variants can give a word more meaning.

The format

The format enhances the information. It is based on a highly cultural and linguistic graphical translation providing an implicit manner of interpreting the text.

In our society, putting a character in bold highlights it, both for titles and within the body of the text. The character font and the position on the page reflect the level of importance: text which is bigger and farther to the left is usually the highest-level title.

The structure

Actually I should say structures: there is not just one structure, but an infinite number of structures depending on what you wish to identify for future use.

– For a novel to be published in both paper and electronic editions, you simply identify the chapters, chapter titles, paragraphs, and the text to be highlighted within each paragraph.

– For a journal article to be published on the Web with automatic search functions in Google Scholar or Google Books (or any other bibliographic database), you mark entries in the bibliography, the authors' name, and the titles of publications or journals cited.

Figure 1-1 Content, Format, Structure

The content (on the left) is made of the raw text—what can be read out loud (audio book).

The format (on the right) provides additional information which is heavily influenced by culture and practices. A title appears larger and in bold. It acts both as a marker and a summary to help readers as they discover the text.

The structure—shown here via callouts—is an abstract representation (in many cases guided by a pre-existing form) intended for multimedia use, without making any choices in principle regarding the final appearance.

Identifying document classes

There is no such thing as a generic document model able to represent any type of document. If one did exist, it would be so complex that it would be impossible to use. Therefore we try to define document classes that correspond to various ways of organizing information—such as a dictionary—or to natural groups such as the collections of a given publisher.

The process of defining document classes, called document analysis, involves extracting the structural elements for future use from a set of similar documents. You usually start from a limited number of available and representative publications, and then gradually build a model meeting your multimedia editorial requirements.

Structured documents

The most basic structured document is a novel or a dissertation. This model is the simplest, the most widely used, the most intuitive, but also the most complex for there is an infinite number of structural variations to manage (even if it means ignoring or simplifying them for electronic editions).

DEFINITION Label

A graphical, textual, or numerical navigational indicator: numbering in a list, chapter numbers, etc.

A dissertation is often (but not always) divided into parts, which in turn are divided into chapters. Each chapter has an (optional) title, preceded by an (optional) number or label for positioning it in the book's organization. When the composition has neither chapter numbers nor a title, it is difficult to mark the chapters in an electronic edition. There are solutions, of course...

The most common structural component within a chapter is a paragraph: a semantic unit defined by the author and represented typographically by both an indentation on the first line—making it easy to see even if it appears at the start of a page—and a carriage return at the end. Within a paragraph, the author can highlight certain words or phrases using bold or italic font, for instance.

Finally, typographical variants related to a paragraph (such as flush right) express various concepts such as a quotation, an excerpt, an epigraph, etc. The number of possible variations is unlimited.

Dictionaries

Each dictionary has its own structure; hence it's not realistic to speak of THE dictionary class structure You will fin either specific structure to each dictionnary, the target being to publish different paper version (for example a paperback dictionary) or electronic versions with advanced features (hypertext link, lookup functions, etc.)

A dictionary is closer to a database structure than to a book structure. It has entries, often sorted alphabetically, organized in semantic units more or less like a data base.

Usually, entries are structured in XML and look more or less like micro-documents which are

Enjoying the preview?

Page 1 of 1

Publishing with XML: Structure, enter, publish

About this ebook

Ligaran

Related authors

Related to Publishing with XML

Related ebooks

Handcraft Epub in 7 Steps

Learning HTML5 by Creating Fun Games

Academic E-Books: Publishers, Librarians, and Users

Learning Adobe Muse

Visual Language for the World Wide Web

Principles of Web Design

Real-World Solutions for Developing High-Quality PHP Frameworks and Applications

openFrameworks Essentials

Mini Style Guide: An Introduction to Good Writing and Manuscript Presentation

Content Strategy: Connecting the dots between business, brand, and benefits

PHP 5 CMS Framework Development - 2nd Edition

Every Page is Page One

Ebooks and Editors: What you need to know

Ultimate Tailwind CSS Handbook: Build sleek and modern websites with immersive UIs using Tailwind CSS

Technical Writing for Business and Engineering Professionals

WordPress Bible

Front Matter, Back Matter, and Metadata

The Book Blueprint: Expert Advice for Creating Industry-Standard Print Books

Optical Character Recognition: Fundamentals and Applications

World without history? Digital information is volatile: with it our culture can disappear but its preservation can save us

WordPress 3 For Business Bloggers

The ebook factory: Strategies, ideas and operational instructions for creating income streams through writing and publishing an ebook

Above the Fold: Understanding the Principles of Successful Web Site Design

HTML5 Games: Creating Fun with HTML5, CSS3 and WebGL

Scrolling: Unlocking the Visual World of Computer Vision

Mastering Responsive Web Design with HTML5 and CSS3

Content Based Image Retrieval: Unlocking Visual Databases

Abbreviations and Signs A Primer of Information about Abbreviations and Signs, with Classified Lists of Those in Most Common Use

How This Book Was Made & How You Can Make Your Own (NEW EDITION)

New Business Models in the Digital Age

Programming For You

Access 2019 Bible

HTML, CSS, and JavaScript Mobile Development For Dummies

SQL All-in-One For Dummies

PHP, MySQL, & JavaScript All-in-One For Dummies

Microsoft Publisher Guide to Success: Learn In A Guided Way How To Format your Page Layout and Graphic Design To Optimize Your Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #9

Coding for Kids Ages 9-15: Simple HTML, CSS and JavaScript lessons to get you started with Programming from Scratch

Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence

JavaScript All-in-One For Dummies

Microsoft OneNote Guide to Success: Boost Your Productivity, Organize Your Notes & Ideas, and Manage Tasks Like a Pro

Unity from Zero to Proficiency (Foundations) Fifth Edition: Unity from Zero to Proficiency, #1

Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]

Python Projects for Everyone

iPhone Made Simple for Seniors & Beginners – Full Color Visual Guide: Step-by-Step Instructions to Take Control & Stay Connected with Confidence

iPhone 14 Guide for Seniors: Unlocking Seamless Simplicity for the Golden Generation with Step-by-Step Screenshots

Microsoft Azure For Dummies

Responsive Web Design with HTML5 and CSS3 Essentials

Python: Learn Python in 24 Hours

Learn PHP in 24 Hours

Learn SQL in 24 Hours

Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees

Hands-on DevOps with Linux: Build and Deploy DevOps Pipelines Using Linux Commands, Terraform, Docker, Vagrant, and Kubernetes (English Edition)

Modern C++ Programming Cookbook

Python Data Structures and Algorithms

Mastering JavaScript: The Complete Guide to JavaScript Mastery

Learn SAP Basis in 24 Hours

Mastering Deep Learning with Keras: From Basics to Expert Proficiency

Beginning Programming with C++ For Dummies

Deep Reinforcement Learning: An Essential Guide

Microsoft SharePoint Guide to Success: Learn In A Guided Way How To Manage and Store Files to Optimize Your Organization, Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #10

HTML, CSS, & JavaScript All-in-One For Dummies

Related categories

Reviews for Publishing with XML

What did you think?

Book preview

Publishing with XML - Ligaran

Acknowledgments

Foreword

How the book is organized

Structuring with XML

Entering XML markup

Publishing

Separating content from format

Modeling a document