Freely available structures: XML Document Type Definitions you can use today

Freely available structures: XML Document Type Definitions you can use today


Organizations of all sizes are beginning to realize how content and its reuse across the enterprise can improve productivity—and the bottom line. The need for change is driven by the desire to better manage information assets (documents, creative ideas, illustrations, charts, graphics, multimedia, etc.) and eliminate costly processes that fail to facilitate the effective and consistent re-use of content. At the heart of managing content for re-use however lies the job of exposing the underlying structure of that information.

The most significant way that structured documents differ from unstructured ones is that structured documents include “rules.” These rules formalize the order in which text, graphics, and tables may be entered into a document by an author. For example, in an unstructured document, a paragraph has specific formatting—font, size, and spacing. In a structured document, this same paragraph also has an exterior “wrapper” that governs the elements that are allowed to appear before and after it. The rules of these elements are often defined in a Document Type Definition (DTD).

Organizations of all sizes are beginning to realize how content and its reuse across the enterprise can improve productivity—and the bottom line

XML is meta markup language that is used to create new markup languages. It’s most commonly used (in the publishing arena) to create tag sets and processing instructions (DTDs) that describe structured content for presentation in text documents, but it can also be used to describe, manage, and deliver content of all types (text, images, voice, forms, multimedia files, and so on) and to transform transactional data between disparate database systems.

To learn more about XML and whether it’s the right choice for your organization, read, XML: The Answer To Everything?, Free Software Magazine, Issue 1.

Darwin Information Typing Architecture (DITA) DTD

Managed by OASIS (Organization for the Advancement of Structured Information Standards), the Darwin Information Typing Architecture (DITA) 1.0 standard was released for public review in early 2005. DITA is designed for content creators, providing a framework for easily authoring online help, user guides, website content, and more. Content creators who use DITA author content in a topic-based tree, with information “typed” by using different subelements. The four main elements are: topic, task, concept, and reference. The real beauty of DITA is that it provides an architecture - unlike other DTDs - that allows for easy expansion through specializations (customizations to the DTD). A schema is also available for DITA.

The Adobe FrameMaker DITA Development GroupThe Adobe FrameMaker DITA Development Group

The DITA DTD, schema, and related files including XSLT, are available in a ZIP file.

There is also an EDD (element definition document) and related files, which works with Adobe FrameMaker.

Learn more about DITA here.

DocBook XML DTD

Available by download from many places, and installed with many tools as a “freebie”, the DocBook XML DTD gives users absolutely every element they might need to create software or hardware documentation. However, DocBook may not be the right choice for some authors, especially those unfamiliar with authoring structured XML content. Additionally, authors have complained that the number of elements contained in DocBook is overwhelming (over 300 valid elements are included) and that as a result, using the DTD is more difficult than it ought to be. For that reason, some organizations have adopted the Simplified DocBook DTD (detailed below).

Check out SourceForge.net and other sites for DocBook projects, and downloadable DTDs and sample files. You may also try a Google search for “Norm Walsh and DocBook” to find examples and historical data on this DTD...it has pre-XML roots in SGML (Standard Generalized Markup Language).

Learn more about DocBook here.

The real beauty of DITA is that it provides an architecture - unlike other DTDs - that allows for easy expansion through specializations (customizations to the DTD). A schema is also available for DITA

A downloadable structured Adobe FrameMaker DocBook XML example project created by Steve Whitlatch is available online as well. It includes an expanded discussion of using the structured version of Adobe FrameMaker and the DocBook XML DTD and details some new auto-formatting features for tables, graphics, and xrefs. Additional information about the DocBook Extensible Stylesheet Language (XSL) project is also provided.

Learn more about DocBook for Adobe FrameMaker here.

Simplified DocBook XML DTD

According to DocBook.org, the Simplified DocBook DTD “is a small subset of the DocBook XML DTD” (105 elements) created to help authors who may be overwhelmed by the sheer number of valid elements in the original DocBook XML DTD (over 300 elements).

Learn more about the Simplified DocBook XML DTD here.

A variety of DocBook resources are available here.

DocBook Slides DTD

The DocBook Slides DTD is also a free DTD contributed for public use by Norm Walsh. By using the DocBook Slides DTD (and the many stylesheets and support files that come with it), you can immediately begin creating XML slides and transforming them into online presentations. The main element is the “Foil”—referring to the old, plastic floppy slides that used to be used for presentations. Users may create a “FoilGroup” and place many “Foils” within their presentation. At the start of a presentation there are elements for creating title and speaker information as well. The DocBook Slides stylesheets transform each foil into an online page with the user’s choice of look-and-feel, a table of contents, and complete navigation (next/previous/contents).

Visit SourceForge.net to download the DocBook Slides project files and corresponding DTD.

Learn more about DocBook Slides here.

  The DocBook project  The DocBook project

A DocBook Slides demonstration is available here.

U.S. Military DTDs

Recently, the Department of the Navy (DON) released its Naming & Design Rules (NDR) version 2.0 for creating and working with XML structures. Version 2.0 firmly establishes a common structure and language for authoring and presenting content, and enables the discovery and reuse of common content elements across the naval enterprise.

Learn more about the U.S. Navy’s XML NDR here.

By using the DocBook Slides DTD (and the many stylesheets and support files that come with it), you can immediately begin creating XML slides and transforming them into online presentations

Learn more about the U.S. Navy’s information management plan here.

The U.S. Department of Defense (DOD) maintains an XML Gallery—a metadata registry and clearinghouse for military XML information. The purpose of the Gallery’s XML Registry is to promote the consistent use of XML, both vertically within individual agencies and horizontally across all DOD agencies.

Learn more about the U.S. DOD XML Gallery.

Conclusion

XML DTDs allow you to add levels of control to the documents you create. DTDs facilitate structured content development, content reuse, consistency, and interoperability. Whether you create your own XML DTDs or utilize one or more of the many freely available XML DTDs is up to you. No matter which approach you take, make sure you learn everything you can about your business needs (those of the end users, as well as the needs of your content creators) before you decide which route to take. Additional resources are provided below for those who’d like to learn more about XXML DTDs.

Bibliography

Document Type Definition Tutorial—a free online class from w3schools

Fundamental Concepts of Reuse—understanding content reuse (Ann Rockley, Pamela Kostur and Steve Manning)

XML Schema Tutorial—a free online class from w3schools

XML Weekend Crash Course

Category: 
License: 

Author information

Kay Ethier's picture

Biography

Kay Ethier is an Adobe Certified Expert in FrameMaker 7.x and several prior versions. She instructs training classes, performs consulting, and provides support to clients in a variety of industries. Kay resides in the Research Triangle Park area of North Carolina and works for Bright Path Solutions. In 2001, Kay co-authored the book "XML Weekend Crash Course" (Wiley/HungryMinds). She has most recently been a contributing author on "Advanced FrameMaker" (TIPS Technical Publishing) and XML and FrameMaker (Apress).

Most forwarded

Interview with Dave Mohyla, of DTIDATA

Dave Mohyla is the president and founder of dtidata.com, a hard drive recovery facility based in Tampa, Florida.

TM: Where are you based? What does your company do?
DTI Data recovery is based in South Pasadena, Florida which is a suburb of Tampa. We have been here for over 10 years. We operate a bio-metrically secured class 100 clean room where we perform hard drive recovery on all types of hard disks, from laptop hard drives to multi drive RAID systems.

Anybody up to writing good directory software?

Since the very beginning, directories (of any kind) have had a very central role in the internet. (I have recently grown fond of Free Web Directory. Even Slashdot can be considered a directory: a collection of great news and invaluable user-generated comments. As far as software is concerned, doing a quick search on Google about software directories will return the free (as in freedom) software directories like Savannah, SourceForge, Freshmeat and so on, followed by shareware and freeware sites such as FileBuzz, PCWin Download Center and All Freeware (great if you're looking for shareware and freeware, but definitely less comprehensive than their free-as-in-freedom counterparts).

Interview with Mark Shuttleworth

Mark Shuttleworth is the founder of Thawte, the first Certification Authority to sell public SSL certificates. After selling Thawte to Verisign, Mark moved on to training as an astronaut in Russia and visiting space. Once he got back he founded Ubuntu, the leading GNU/Linux distribution. He agreed on releasing a quick interview to Free Software Magazine.

Is better education the key to finding better software?

I read David Jonathon's article Anybody Up To Writing Good Directory Software? the other day, which got me thinking about software directories in general. As David mentioned, many of the software directories one finds when doing a quick google search are free as in beer, not as in freedom. But what interests me is the software directories that already exist, providing a combination of both free as in beer software, and open source software. Sites such as Freeware Downloads and Shareware Download don't advertise themselves as providing free as in liberty software, but each of them have a good selection of open source software available... if you know where to look.

Most emailed

Free Open Document label templates

If you’ve ever spent hours at work doing mailings, cursed your printer for printing outside the lines on your labels, or moaned “There has got to be a better way to do this,” here’s the solution you’ve been looking for. Working smarter, not harder! Worldlabel.com, a manufacture of labels offers Open Office / Libre Office labels templates for downloading in ODF format which will save you time, effort, and (if you want) make really cool-looking labels

Creating a user-centric site in Drupal

A little while ago, while talking in the #drupal mailing list, I showed my latest creation to one of the core developers there. His reaction was "Wow, I am always surprised what people use Drupal for". His surprise is somehow justified: I did create a site for a bunch of entertainers in Perth, a company set to use Drupal to take over the world with Entertainers.Biz.

Update: since writing this article, I have updated the system so that the whole booking process happens online. I will update the article accordingly!

So, why, why do people and companies develop free software?

More and more people are discovering free software. Many people only do so after weeks, or even months, of using it. I wonder, for example, how many Firefox users actually know how free Firefox really is—many of them realise that you can get it for free, but find it hard to believe that anybody can modify it and even redistribute it legally.

When the discovery is made, the first instinct is to ask: why do they do it? Programming is hard work. Even though most (if not all) programmers are driven by their higher-than-normal IQs and their amazing passion for solving problems, it’s still hard to understand why so many of them would donate so much of their time to creating something that they can’t really show off to anybody but their colleagues or geek friends.

Sure, anybody can buy laptops, and just program. No need to get a full-on lab or spend thousands of dollars in equipment. But... is that the full story?

Fun articles

Santa Claus - the most successful open source project

It dawned on me the other day, as I was shopping for the dozens of gifts it seems I have to buy every December, that Santa Claus is the most successful open source project in history. (Bridget @ Illiterarty would agree with that). Santa Claus is essentially a marketing development that is embodied by everyone who stuffs a sock, gives a gift, hosts a dinner or wishes Merry Christmas over the holiday season.

Most emailed

Editorial

When I first started thinking about Free Software Magazine, I was feeling enthusiastic about the dream. I had Dave, Gianluca, and Alan willing to help me, I had established members of the free software community willing to help me out, I had writers volunteering their time and energy for free, and I had a generous offer from OpenHosting for servers, all before I'd proved myself. There was a sense of excitement in the air, and I thought maybe, just maybe, I could make this work.

Free Software Magazine uses Apollo project management software and CRM for its everyday activities!