What is XML? is the first in a series of BNC 101 blog posts where we’re going to try our best to break down some of the complex tech concepts we talk about all the time into plain language. Wish us luck!
Have an idea for a BNC 101 blog post? Leave a comment below to let us know.
XML is a term that gets thrown around the publishing industry a lot, but what does it actually mean?
First, XML stands for Extensible Markup Language. XML doesn’t do anything; instead, it lets you describe what something is. It is a text format that lets you define information for computer-to-computer communication. Basically, it’s a way to let two programs that speak different languages talk to each other.
I find it’s best to think of XML as content without form: XML is what is in the background describing what everything is, then how it looks is determined by where that information is being used. Some familiar examples of XML-based languages include: XHTML for the web, IDML for InDesign, and ONIX for book information.
Elements
Elements are the building blocks of XML. Think of these elements like descriptors, adjectives attributed to the content. Each bit of content gets described by the element. Elements are made up of opening and closing tags, and the content goes between these tags.
Elements look like this:
<tag>content</tag>
Or, for something publishing-specific, like this:
<Title>Canadian Book Market</Title>
XML allows you to describe infinite amounts of information, but it is the receiving program that decides what to do with it. If an online store receives your file, it will take that title tag and know to post the title as the title on its website. Some programs act on more of the described information than others, so to be on the safe side it’s best to provide more rather than less information to avoid blanks.
For example, if a book has a Canadian author, you would want to add an element that says the author is Canadian. Even if some receivers of the XML file won’t process it some will and it will be to your advantage.
XML always sounds big and scary, but really it’s just another version of something we’ve been doing for years in publishing: marking up documents in the same way you would a manuscript.
The publishing-specific XML language for transferring information about your books is called ONIX. Next week, we’ll post BNC 101: What is ONIX?—your introduction to online information exchange.
You can find all of our introductory blog posts in the BNC 101 category.