User Guide for Elmo

Peter Mika

Initial version

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in GNU Free Documentation License.


Table of Contents

1. Introduction
1.1. What is Elmo?
2. Elmo object model
2.1. Hello, world!
2.2. Using a persistent Sesame store as a back-end
2.3. RSS example
2.4. Dynamic access to ontologies
2.5. More about the ElmoSession
2.6. Extending the object model
3. Elmo Tools
3.1. The Elmo Scutter
3.2. The Elmo Smusher
3.3. The Elmo Validator
A. Resources
A.1.
A.2. Questions, comments
A.3. Acknowledgement
B. GNU Free Documentation License
B.1. PREAMBLE
B.2. APPLICABILITY AND DEFINITIONS
B.3. VERBATIM COPYING
B.4. COPYING IN QUANTITY
B.5. MODIFICATIONS
B.6. COMBINING DOCUMENTS
B.7. COLLECTIONS OF DOCUMENTS
B.8. AGGREGATION WITH INDEPENDENT WORKS
B.9. TRANSLATION
B.10. TERMINATION
B.11. FUTURE REVISIONS OF THIS LICENSE
B.12. How to use this License for your documents

Abstract

This document discusses both the object model of Elmo and the tools in the distribution. The documentation of the library is intended for both users and developers (i.e. those who would like to extend the object model or the tools). We recommend reading this documentation in parallel to the JavaDoc as the two are mostly complementary.

Reading this document assumes a basic knowledge of RDF concepts, a familiarity with the openrdf model classes and concepts related to Sesame repositories or a willingness to learn about them as we go along.

Namespace abbreviations are used in this document for readability. For example, the term foaf:Person represents the RDF resource denoted with the URI http://xmlns.com/foaf/0.1/Person.

1. Introduction

1.1. What is Elmo?

In short, Elmo is for anyone planning to develop Semantic Web applications using JavaBeans and Sesame.

Elmo is Java library for Semantic Web applications. Elmo allows developers to create applications that work with RDF/OWL knowledge bases at the level of ontologies, using JavaBeans, instead of working on the level of RDF/OWL languages, using generic Resource Objects.

Elmo provides support for developing applications using the most popular Web ontologies, including FOAF, RSS 1.0 and Dublin Core. Elmo contains a static object model for these ontologies as well as some tools to work with them, for example an RDF crawler and a smusher for FOAF data. Elmo gives dynamic access to data in other ontologies and is also easily extendible with new static models.

Elmo uses the popular Sesame storage and query facility as a backend, representing a good choice for working with the lightweight ontologies that are most commonly used in web applications, both in terms of features and scalability.

2. Elmo object model

Elmo is a simple Java library, which means that installing it simply requires including the jar file in the classpath of the compiler and importing the appropriate classes in the code. If you want a persistant storage of your objects, however, you will also need to install and setup Sesame as a server or library.

2.1. Hello, world!

The Elmo object model builds on a simple concept: each ontological class has a matching Java class in the library, using the same name. Classes from a single ontology are collected in subpackages of the org.openrdf.elmo.model package. For example, classes of the FOAF model can be found in the Java package org.openrdf.elmo.model.foaf.

Properties of an instance can be set or read by calling getter/setter methods on the Java object. Inheritance is used where there is a subclass relationship between two ontological classes. Unlike RDF(S), Java does not support multiple inheritance, so each Java class has only one superclass.

Let's start with the simplest possible example, where we create a foaf:Person object, set some of its properties and lastly write it out as RDF. This example and the others to follow can be found in the Elmo source distribution in the package org.openrdf.elmo.example.

ElmoSession session = new ElmoSession(); 
URI jackURI = session.createURI("http://www.jackandjill.example.org/#jack"); 
         
Person jack = (Person) session.getInstance(jackURI, Person.class); 
 
jack.setName(session.createLiteral("Jack"));
jack.setMbox(session.createURI("mailto:jack@jackandjill.example.org"));
jack.setMbox(session.createURI("mailto:jack@work.example.org")); 
   
RdfDocumentWriter writer = new AbbreviatedRdfXmlWriter(System.out); 
writer.setNamespace("foaf", Person.FOAF_NS); 
writer.startDocument(); 
new PersonWriter().writeRDF(writer, jack); 
writer.endDocument(); 

The output looks like this:

<?xml version="1.0" encoding="UTF-8"?> 
<rdf:RDF 
 xmlns:foaf="http://xmlns.com/foaf/0.1/" 
 xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" 
 xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"> 
<foaf:Person rdf:about="http://www.jackandjill.example.org/#jack"> 
 <foaf:name>Jack</foaf:name> 
 <foaf:mbox rdf:resource="mailto:jack@work.example.org"/> 
 <foaf:mbox rdf:resource="mailto:jack@jackandjill.example.org"/> 
</foaf:Person> 
</rdf:RDF>

In this example, we first create an in-memory ElmoSession by calling the default constructor. An ElmoSession provides methods to acquire Elmo objects and manages their interaction with the underlying repository. Read more about the ElmoSession.

Next, we create the URI that will identify the new Person object. ElmoSession also provides factory methods for creating Sesame BNode (blank nodes) and Literal. (In the Sesame object model, BNode and URI are subinterfaces of the more generic Resource interface and they are implemented as BNodeImpl and URIImpl. For more details, see the openrdf model documentation.) Using the factory methods of ElmoSession is the preferred way to create these objects.

Next, we create an instance of the Person class by calling another session method, getInstance. This version of the method takes the id of the resource and the Java class to be returned as identifiers. (Note: Elmo objects should not be created using their public constructors.) The Sesame Resource object that was used as identifier can be later retrieved using the getResource method.

Once we have a reference to a Person object, we can set some properties. In this case, we state the name of the Person and two of his email addresses. Setter methods either take instances of Literal or Resource as arguments or other model objects. (This latter case will be demonstrated later on.)

Lastly, we write out an RDF/XML serialization of this Person by creating an instance of RdfDocumentWriter and of PersonWriter. RdfDocumentWriter is a RIO interface and it is implemented by a number of serializers capable of producing various RDF syntaxes including RDF/XML, N3, N-Triples and Turtle. For more information, see the RIO documentation. PersonWriter is an Elmo Writer that can serialize Person objects in various ways.

Note that for nicer formatting, we set the namespace abbreviation for FOAF. Namespaces and URI objects for properties can be accessed as static members of the model classes.

2.2. Using a persistent Sesame store as a back-end

The objects we have created so far were in memory representations. However, it is also possible to use a persistant Sesame repository implementation as backend.(Sesame has a variety of repository implementations using databases, the file system, memory etc. ) This repository will then be used to retrieve the values of properties as well as store the values set.

The ElmoRepository class has a number of static methods to access Sesame repositories in different ways. ElmoRepository is wrapper for the SesameRepository class of the Sesame API and exposes all methods of the underlying SesameRepository. For additional information, please see the documentation for SesameRepository.

The easiest way is to access a remote repository; this can be done by providing the URL of the server and the name of the repository. The obtained ElmoRepository is then used when instantiating the ElmoSession:

//Repository-based ElmoSession 
ElmoRepository repository = ElmoRepository.getRepository(new URL("http://localhost:8080/sesame"), "test"); 
ElmoSession session = new ElmoSession(repository); 

While it's possible to instantiate model objects one-by-one using the getInstance method shown above, it is more common to retrieve a set of instances matching a query. This is done by the getInstances method, which in its most simple form requires only the query and the class to be returned:

String query = "SELECT person FROM {person} <" + Person.FOAF_NAME + "> {name} WHERE name=\"Jack\""; 
Collection persons = session.getInstances(query, Person.class); 
Iterator it = persons.iterator();

A limitation is that the query must return a single column and cannot use a '*' in the select clause. It is of course also possible execute queries directly on the ElmoRepository and instantiate resources one-by-one as iterating over the result.

The Person class (and some other classes) also have convenience methods to retrieve all Person instances in a repository or to retrieve instances matching a certain description. See the JavaDoc for more detail.

Important to note about repository based ElmoSessions is that all settter methods write directly to the repository by default. This automatic commit feature can be disabled by using the setAutoCommit method. Once autocommit is disabled, changes can be committed by calling the commit method and they can be discarded with rollback. Get methods do not return set values unless they are committed.

Note: The Elmo API requires a Sesame server installation with a version number of 1.2 or higher.

2.3. RSS example

Consider also the following example using the RSS model. In this example, we create an RSS channel and put two items on it. Lastly, as before, we write out the contents of the channel as RDF.

ElmoSession session = new ElmoSession(); 
   
Resource channelURI = session.createURI("http://rss.example.org"); 
 
Channel channel = (Channel) session.getInstance(channelURI, Channel.class); 
   
channel.setTitle(session.createLiteral("Test channel")); 
channel.setLink(session.createLiteral("http://www.cs.vu.nl/")); 
channel.setDescription(session 
    .createLiteral("This is the description of this channel.")); 
   
DCResource dcChannel = (DCResource) session.getInstance(channelURI, DCResource.class); 
dcChannel.setSubject(session.createLiteral("Semantic Web")); 
   
   
Item item = (Item) session.getInstance(session 
    .createURI("http://rss.example.org#1"), Item.class); 
item.setTitle(session.createLiteral("Our latest news item")); 
item.setDescription(session 
    .createLiteral("This is the description of the latest news item")); 
channel.addItem(item); 
 
item = (Item) session.getInstance(session 
    .createURI("http://rss.example.org#2"), Item.class); 
item.setTitle(session.createLiteral("An older news item")); 
item.setDescription(session 
    .createLiteral("This is the description of the older news item")); 
channel.addItem(item); 

RdfDocumentWriter writer = new AbbreviatedRdfXmlWriter(System.out); 
writer.setNamespace("rss", Channel.RSS_NS); 
writer.setNamespace("dc", DCResource.DC_NS); 

writer.startDocument(); 
new ChannelWriter().writeRDF(writer, channel); 
writer.endDocument(); 
 

You might note that the Channel class has a convenience method for adding an item to the channel (addItem), which also takes care of creating a Seq instance (if this is the first item) and linking it to the channel. There is also a convenience method for accessing the items that have been added (getItemsAsList).

We can also see in this example how multiple typing is used Elmo. Namely, we create an instance of the Dublin Core ontology with the same URI as the channel and then set a value for the dc:subject property. We don't have to serialize this resource separately because the writeRDF method of the Channel class takes care of that. (RSS 1.0 is often used in combination with Dublin Core.)

On a minor note: we also have to use the AbbreviatedRdfXmlWriter to serialize RSS channels, because the RSS 1.0 specification requires this format. In particular, RSS 1.0 documents should also validate against the DTD of XML-based versions of RSS, because many existing tools processs RSS 1.0 using XML parsers. There are other, soft rules of RSS validation (such as the recommended number of items per channel or the allowed length of descriptions) and for this reason we recommend checking the validity of the produced RDF with one (or preferable more) of the many RSS validators available on the Web.

2.4. Dynamic access to ontologies

Elmo provides JavaBeans to access popular ontologies and is easily expanded to support more, see Extending the object model for more information. Elmo also provides a dynamic interface to access resources, using Apache's beanutils' DynaBean interface.

DynaResource has a one getter and one setter method, #get(String) and #set(String,Object) respectively. The property name (String) is the local name of the property. If there are multiple properties with the same local name the behaviour is not guaranteed and DynaResource should not be used to access that property. The return Object of the getter and the Object of the setter are based on the ontology of the class types of the resource. A Resource wrapped by a DynaResource will return/expect one of three values: Set, List, or DynaResource as described in the following table.

owl:maxCardinality of 1multiple cardinality (default)
rdf:range or owl:allValuesFrom of rdf:Listjava.util.List of DynaResourcejava.util.Set of DynaResource
not a List (default)DynaResourcejava.util.Set of DynaResource

If the property is a Literal, a DynaResource is still returned. Its value can be accessed by using the #as(Class) method. Otherwise the DynaResource is read-only and provides five properties: label, language, datatype, type, and locale; they return String, String, model.URI, Collection of rdfs.Class and Locale, respectively.

The expected property type can be retrieved at runtime with the method:

DynaResource#getDynaClass().getDynaProperty(String).getType()

See the DynaResourceTest class for more examples.

2.5. More about the ElmoSession

The primary function of the ElmoSession is to provide performance improvements by (a memory-sensitive) caching and by expanding queries. The limits of query expansion are application dependent: while executing a more general query than necessary often improves performance by speeding up future access, it can also slow down the application temporarily or result in overly large result sets that break memory limits. For this reason query expansion can be regulated.

The getInstances method mentioned above can also take a third boolean parameter that specifies whether all properties of all instances should be preloaded. Another form of query expansion occurs when a property is read on an instance that was the result of a query. In that case, that property is read for all instances that were also the result of that query. This kind of query expansion can be enabled or disabled by calling the setExpandQuery method of the ElmoSession.

ElmoSession also implements the String BeanFactory. Providing #getBean(String,Class) as an alternative to #getInstance(URI,Class). Also included is the class ElmoSessionFactory, which implements String's FactoryBean. This class provides abstract ElmoSession creation. It can be used as follows:

ElmoSessionFactory factory = new ElmoSessionFactory();
factory.setConfigFile("config.xml");
factory.setRepositoryId("main");
factory.init();
ElmoSession session = factory.getSession();
      

Note: All ElmoSessions created with the same factory will share the same schema cache, which reduces multiple session overhead.

2.6. Extending the object model

The object model of Elmo is easily extensible with additional classes to support your own ontologies.

Again, the best way to understand how things work is to consider an example. Below is the full source of the foaf:Group class:

package org.openrdf.elmo.model.foaf; 
 
public class Group extends FOAFResource { 
 
    final static URI FOAF_GROUP = new URIImpl(FOAF_NS + "Group"); 
 
    public static final URI ABOUT = FOAF_GROUP; 
     
    /* 
     * Properties with a domain of foaf:Group 
     */ 
 
    final static URI FOAF_MEMBER = new URIImpl(FOAF_NS + "member"); 
 
    public Group() { 
        super(); 
    } 
 
    public Group(String uri) { 
        super(uri); 
    } 
 
 
    //MEMBER 
    public Set getMember() throws QueryEvaluationException { 
        return _session.getInstances(_id, FOAF_MEMBER, Agent.class); 
    } 
 
    public void setMember(Agent value) { 
        _session.addStatement(_id, FOAF_MEMBER, value.getResource()); 
    } 
    
}

Model classes are all subclasses of Elmo's Resource class from the rdfs subpackage (not to be confused with Sesame Resource interface). This class represents the RDF(S) notion of a Resource with all the properties defined in RDF(S) such as label, comment and seeAlso.

In the case of Group, there is another class between Resource and Group in the inheritance hierarchy: FOAFResource. This class doesn't have an ontological equivalent but it is a placeholder for all the properties that the FOAF ontology defines for all resources (these are name, homepage and depiction at the time of writing).

Subclasses of Resource should implement two public constructors, one default, and one with a single String parameter, that call the constructor of the superclass. These constructors are used to instantiate blank nodes.

Next, a get and a set methods are defined for the only property of foaf:Group, which is foaf:member. The getter method should call the getInstances method of ElmoSession that takes the id of the resource, the property (a URI) and the class of values to be returned. In this case, the range of the property is another model class, foaf:Agent. The setter method calls the addStatement method of ElmoSession with the id, the property (the same URI) and the Sesame Value to be set. In the case of an Elmo resource (as here) the Sesame Resource to be added can be retrieved by calling getResource.

3. Elmo Tools

Elmo comes with a number of utility classes for processing FOAF data. All of these tools are provided with a servlet interface and can be readily deployed in a servlet container such as Tomcat by placing the elmo.war package in the web application directory. You will need to modify the provided web.xml file to specify the details of your Sesame installation. After restarting Tomcat, the tools can be accessed by pointing the browser to the root of the installation (http://www.localhost:8080/elmo).

3.1. The Elmo Scutter

The Elmo scutter is a generic RDF crawler that follows rdfs:seeAlso links in RDF documents, which typically point to other relevant RDF sources on the web. The Elmo scutter is based on original code by Matt Biddulph for Jena.

RDF(S) seeAlso is also the mechanism used to connect FOAF profiles and thus (given a starting location) the scutter allows to collect FOAF profiles from the Web. Several advanced features are provided to support this scenario:

  • Blacklisting: sites that produce FOAF profiles in large quantities are automatically placed on a blacklist. This is to avoid collecting large amounts of uninteresting FOAF data produced by social networking and blogging services or other dynamic sources.

  • Whitelisting: the crawler can be limited to a domain (defined by a URL pattern).

  • Metadata: the crawler can optionally store metadata about the collected statements. This metadata currently includes provenance (what URL was the information coming from) and timestamp (time of collection)

  • Filtering: incoming statements can be filtered individually. This is useful to remove unnecessary information, such as statements from unknown namespaces.

  • Persistence: when the scutter is stopped, it saves its state to the disk. This allows to continue scuttering from the point where it left off. Also, when starting the scutter it tries to load back the list of visited URLs from the repository (this requires the saving of metadata to be turned on).

  • Preload from Google: the scutter queue can be preloaded by searching for FOAF files using Google

  • Logging: The Scutter uses log4j to provide a detailed logging of the crawler.

The data collected by the scutter is stored in a Sesame repository (see configuration below). We recommend using a Native RDF repository for scuttering, because it provides the best performance for uploads.

The Scutter is available as a Java class as well as a Java servlet. The servlet provides access to all of the above features, except for filtering (which requires programming) and it can be deployed by placing the Elmo.war file in the web application directory of a Servlet/JSP container.

The servlet initialization parameters to be specified in the web.xml descriptor file are listed below. An example web.xml file is provided in the war file.

Table 2. Scutter servlet initialization parameters

Parameter nameDescriptionRequired/Optional/Default
serverURL of the Sesame server to store the collected dataRequired
repositoryName of the repository on the serverRequired
usernameUsername for access to the Sesame repositoryOptional
passwordPassword for access to the Sesame repositoryOptional
queueLocation of the file used to save the queue when the scutter is stoppedRequired
preloadSizeNumber of seed URLs to extract from GoogleOptional, default = 500
startURL used to start scutteringOptional
domainLimits crawling to URLs that match the provided regular expression.Optional
metadataProduce reified statements containing information about the provenance of the statements and the time they were collected. Possible values: true/falseOptional, defaults to false.

The request parameters to the server are listed in the table below. For convenience, there is an html file provided in the distribution for calling various operations on the servlet.

Table 3. Scutter servlet request parameters

Parameter nameDescriptionRequired/Optional/Default
startTry to load the set of visited URLs and start the scutterParameter value ignored.
stopStop the scutter, save the queue to diskParameter value ignored.
searchPreload the queue by searching for FOAF profiles using a search engineParameter value ignored.
preloadQueuePreload the queue from the saved fileParameter value ignored.
clearClear the queue and the set of visited URLsParameter value ignored.

A custom filtering of statements can be implemented by setting an instance of the StatementFilter interface using the setStatementFilter method of the Scutter class. See the JavaDoc for more details.

3.2. The Elmo Smusher

The task of the Elmo smusher is to find equivalent instances in large sets of data. This is a very common problem when processing collections of FOAF profiles as several sources on the Web may describe a the same individual using different identifiers or blank nodes (which are always assumed to be different). While the servlet provided is specific to smushing foaf:Person instances, the underlying mechanism is generic

The smusher uses instances of ResourceComparator for comparing instances. Implementations of ResourceComparator are given for foaf:Person and swrc:Publication.

The smusher reports the results (matching instances) by calling methods on registered listeners. Listeners implement the SmusherListener interface. Two implementations of SmusherListener are provided: one writes out results in text, while the other represents matches using the owl:sameAs relationship and uploads such statements to a Sesame repository. While Sesame does not directly support OWL semantics, the semantics of this relationship (the equivalence of property values) can be easily axiomatized using Sesame's custom rule language.

The servlet can be deployed by placing the elmo.war file in the web application directory of a Servlet/JSP container.

The servlet initialization parameters to be specified in the web.xml descriptor file are listed below. An example web.xml file is provided in the war file.

Table 4. Smusher servlet initialization parameters

Parameter nameDescriptionRequired/Optional/Default
serverURL of the Sesame server to store the collected dataRequired
repositoryName of the repository on the serverRequired
usernameUsername for access to the Sesame repositoryOptional
passwordPassword for access to the Sesame repositoryOptional

The smusher servlet has no request parameter. As the service is invoked it starts smushing on the repository defined in the web application descriptor (web.xml).

Note that the smusher is currently not iterative, i.e. smushing is performed only once. You may want to repeat smushing in case new matches can be found on the basis of what has been inferred in a previous round.

3.3. The Elmo Validator

The task of the Elmo validator is to validate instances using Java. Programmatic validation is often necessary, since not every requirement in ontologies can be represented in RDF or OWL. For example, validation can be used to check common mistakes in FOAF profiles such as providing a Literal as the value of the mbox property.

Again, the validation mechanism is more general and available from Java by calling the validate method on classes that support validation, such as the Person class in the foaf subpackage. Validation is implemented using a listener pattern: various levels of validation errors are reported by calling methods on the listener passed in to the validate method. Listeners implement the AdminListener interface of Sesame. Standard implementations of this interface are provided with Sesame, e.g. for writing out errors as plain text, HTML or XML.

There is also servlet interface to validation (ValidatorServlet) which validates all foaf:Person instances found in a repository. This is particulary useful when checking the results of scuttering.

The ValidatorServlet takes the same initialization parameters (specified in the web.xml file) as the Smusher. There are no request parameters to specify.

A. Resources

RDF

the openrdf model classes and concepts related to Sesame repositories

See the JUnit test provided with the source for more code examples.

A.2. Questions, comments

If you have question regarding Elmo, please use the forums on openrdf.org. Feature requests and bugs can be reported using the Issue Tracker on the same website.

Contact the author of this document ( Peter Mika) with comments and suggestion regarding the documentation itself.

A.3. Acknowledgement

Most of the Elmo code has originally appeared in the Semantic Web application Flink, which still serves as a testbed for the library. Making this work public, however, would not have been possible without the help and support of Arjohn Kampman and Jeen Broekstra from Aduna. James Leigh has contributed fundamental improvements and key new features to Elmo from version 0.2 and continues to be the leading co-developer on Elmo.

B. GNU Free Documentation License

Version 1.1, March 2000

Copyright (C) 2000 Free Software Foundation, Inc. 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed.

B.1. PREAMBLE

The purpose of this License is to make a manual, textbook, or other written document "free" in the sense of freedom: to assure everyone the effective freedom to copy and redistribute it, with or without modifying it, either commercially or noncommercially. Secondarily, this License preserves for the author and publisher a way to get credit for their work, while not being considered responsible for modifications made by others.

This License is a kind of "copyleft", which means that derivative works of the document must themselves be free in the same sense. It complements the GNU General Public License, which is a copyleft license designed for free software.

We have designed this License in order to use it for manuals for free software, because free software needs free documentation: a free program should come with manuals providing the same freedoms that the software does. But this License is not limited to software manuals; it can be used for any textual work, regardless of subject matter or whether it is published as a printed book. We recommend this License principally for works whose purpose is instruction or reference.

B.2. APPLICABILITY AND DEFINITIONS

This License applies to any manual or other work that contains a notice placed by the copyright holder saying it can be distributed under the terms of this License. The "Document", below, refers to any such manual or work. Any member of the public is a licensee, and is addressed as "you".

A "Modified Version" of the Document means any work containing the Document or a portion of it, either copied verbatim, or with modifications and/or translated into another language.

A "Secondary Section" is a named appendix or a front-matter section of the Document that deals exclusively with the relationship of the publishers or authors of the Document to the Document's overall subject (or to related matters) and contains nothing that could fall directly within that overall subject. (For example, if the Document is in part a textbook of mathematics, a Secondary Section may not explain any mathematics.) The relationship could be a matter of historical connection with the subject or with related matters, or of legal, commercial, philosophical, ethical or political position regarding them.

The "Invariant Sections" are certain Secondary Sections whose titles are designated, as being those of Invariant Sections, in the notice that says that the Document is released under this License.

The "Cover Texts" are certain short passages of text that are listed, as Front-Cover Texts or Back-Cover Texts, in the notice that says that the Document is released under this License.

A "Transparent" copy of the Document means a machine-readable copy, represented in a format whose specification is available to the general public, whose contents can be viewed and edited directly and straightforwardly with generic text editors or (for images composed of pixels) generic paint programs or (for drawings) some widely available drawing editor, and that is suitable for input to text formatters or for automatic translation to a variety of formats suitable for input to text formatters. A copy made in an otherwise Transparent file format whose markup has been designed to thwart or discourage subsequent modification by readers is not Transparent. A copy that is not "Transparent" is called "Opaque".

Examples of suitable formats for Transparent copies include plain ASCII without markup, Texinfo input format, LaTeX input format, SGML or XML using a publicly available DTD, and standard-conforming simple HTML designed for human modification. Opaque formats include PostScript, PDF, proprietary formats that can be read and edited only by proprietary word processors, SGML or XML for which the DTD and/or processing tools are not generally available, and the machine-generated HTML produced by some word processors for output purposes only.

The "Title Page" means, for a printed book, the title page itself, plus such following pages as are needed to hold, legibly, the material this License requires to appear in the title page. For works in formats which do not have any title page as such, "Title Page" means the text near the most prominent appearance of the work's title, preceding the beginning of the body of the text.

B.3. VERBATIM COPYING

You may copy and distribute the Document in any medium, either commercially or noncommercially, provided that this License, the copyright notices, and the license notice saying this License applies to the Document are reproduced in all copies, and that you add no other conditions whatsoever to those of this License. You may not use technical measures to obstruct or control the reading or further copying of the copies you make or distribute. However, you may accept compensation in exchange for copies. If you distribute a large enough number of copies you must also follow the conditions in section 3.

You may also lend copies, under the same conditions stated above, and you may publicly display copies.

B.4. COPYING IN QUANTITY

If you publish printed copies of the Document numbering more than 100, and the Document's license notice requires Cover Texts, you must enclose the copies in covers that carry, clearly and legibly, all these Cover Texts: Front-Cover Texts on the front cover, and Back-Cover Texts on the back cover. Both covers must also clearly and legibly identify you as the publisher of these copies. The front cover must present the full title with all words of the title equally prominent and visible. You may add other material on the covers in addition. Copying with changes limited to the covers, as long as they preserve the title of the Document and satisfy these conditions, can be treated as verbatim copying in other respects.

If the required texts for either cover are too voluminous to fit legibly, you should put the first ones listed (as many as fit reasonably) on the actual cover, and continue the rest onto adjacent pages.

If you publish or distribute Opaque copies of the Document numbering more than 100, you must either include a machine-readable Transparent copy along with each Opaque copy, or state in or with each Opaque copy a publicly-accessible computer-network location containing a complete Transparent copy of the Document, free of added material, which the general network-using public has access to download anonymously at no charge using public-standard network protocols. If you use the latter option, you must take reasonably prudent steps, when you begin distribution of Opaque copies in quantity, to ensure that this Transparent copy will remain thus accessible at the stated location until at least one year after the last time you distribute an Opaque copy (directly or through your agents or retailers) of that edition to the public.

It is requested, but not required, that you contact the authors of the Document well before redistributing any large number of copies, to give them a chance to provide you with an updated version of the Document.

B.5. MODIFICATIONS

You may copy and distribute a Modified Version of the Document under the conditions of sections 2 and 3 above, provided that you release the Modified Version under precisely this License, with the Modified Version filling the role of the Document, thus licensing distribution and modification of the Modified Version to whoever possesses a copy of it. In addition, you must do these things in the Modified Version:

  1. Use in the Title Page (and on the covers, if any) a title distinct from that of the Document, and from those of previous versions (which should, if there were any, be listed in the History section of the Document). You may use the same title as a previous version if the original publisher of that version gives permission.

  2. List on the Title Page, as authors, one or more persons or entities responsible for authorship of the modifications in the Modified Version, together with at least five of the principal authors of the Document (all of its principal authors, if it has less than five).

  3. State on the Title page the name of the publisher of the Modified Version, as the publisher.

  4. Preserve all the copyright notices of the Document.

  5. Add an appropriate copyright notice for your modifications adjacent to the other copyright notices.

  6. Include, immediately after the copyright notices, a license notice giving the public permission to use the Modified Version under the terms of this License, in the form shown in the Addendum below.

  7. Preserve in that license notice the full lists of Invariant Sections and required Cover Texts given in the Document's license notice.

  8. Include an unaltered copy of this License.

  9. Preserve the section entitled "History", and its title, and add to it an item stating at least the title, year, new authors, and publisher of the Modified Version as given on the Title Page. If there is no section entitled "History" in the Document, create one stating the title, year, authors, and publisher of the Document as given on its Title Page, then add an item describing the Modified Version as stated in the previous sentence.

  10. Preserve the network location, if any, given in the Document for public access to a Transparent copy of the Document, and likewise the network locations given in the Document for previous versions it was based on. These may be placed in the "History" section. You may omit a network location for a work that was published at least four years before the Document itself, or if the original publisher of the version it refers to gives permission.

  11. In any section entitled "Acknowledgements" or "Dedications", preserve the section's title, and preserve in the section all the substance and tone of each of the contributor acknowledgements and/or dedications given therein.

  12. Preserve all the Invariant Sections of the Document, unaltered in their text and in their titles. Section numbers or the equivalent are not considered part of the section titles.

  13. Delete any section entitled "Endorsements". Such a section may not be included in the Modified Version.

  14. Do not retitle any existing section as "Endorsements" or to conflict in title with any Invariant Section.

If the Modified Version includes new front-matter sections or appendices that qualify as Secondary Sections and contain no material copied from the Document, you may at your option designate some or all of these sections as invariant. To do this, add their titles to the list of Invariant Sections in the Modified Version's license notice. These titles must be distinct from any other section titles.

You may add a section entitled "Endorsements", provided it contains nothing but endorsements of your Modified Version by various parties--for example, statements of peer review or that the text has been approved by an organization as the authoritative definition of a standard.

You may add a passage of up to five words as a Front-Cover Text, and a passage of up to 25 words as a Back-Cover Text, to the end of the list of Cover Texts in the Modified Version. Only one passage of Front-Cover Text and one of Back-Cover Text may be added by (or through arrangements made by) any one entity. If the Document already includes a cover text for the same cover, previously added by you or by arrangement made by the same entity you are acting on behalf of, you may not add another; but you may replace the old one, on explicit permission from the previous publisher that added the old one.

The author(s) and publisher(s) of the Document do not by this License give permission to use their names for publicity for or to assert or imply endorsement of any Modified Version.

B.6. COMBINING DOCUMENTS

You may combine the Document with other documents released under this License, under the terms defined in section 4 above for modified versions, provided that you include in the combination all of the Invariant Sections of all of the original documents, unmodified, and list them all as Invariant Sections of your combined work in its license notice.

The combined work need only contain one copy of this License, and multiple identical Invariant Sections may be replaced with a single copy. If there are multiple Invariant Sections with the same name but different contents, make the title of each such section unique by adding at the end of it, in parentheses, the name of the original author or publisher of that section if known, or else a unique number. Make the same adjustment to the section titles in the list of Invariant Sections in the license notice of the combined work.

In the combination, you must combine any sections entitled "History" in the various original documents, forming one section entitled "History"; likewise combine any sections entitled "Acknowledgements", and any sections entitled "Dedications". You must delete all sections entitled "Endorsements."

B.7. COLLECTIONS OF DOCUMENTS

You may make a collection consisting of the Document and other documents released under this License, and replace the individual copies of this License in the various documents with a single copy that is included in the collection, provided that you follow the rules of this License for verbatim copying of each of the documents in all other respects.

You may extract a single document from such a collection, and distribute it individually under this License, provided you insert a copy of this License into the extracted document, and follow this License in all other respects regarding verbatim copying of that document.

B.8. AGGREGATION WITH INDEPENDENT WORKS

A compilation of the Document or its derivatives with other separate and independent documents or works, in or on a volume of a storage or distribution medium, does not as a whole count as a Modified Version of the Document, provided no compilation copyright is claimed for the compilation. Such a compilation is called an "aggregate", and this License does not apply to the other self-contained works thus compiled with the Document, on account of their being thus compiled, if they are not themselves derivative works of the Document.

If the Cover Text requirement of section 3 is applicable to these copies of the Document, then if the Document is less than one quarter of the entire aggregate, the Document's Cover Texts may be placed on covers that surround only the Document within the aggregate. Otherwise they must appear on covers around the whole aggregate.

B.9. TRANSLATION

Translation is considered a kind of modification, so you may distribute translations of the Document under the terms of section 4. Replacing Invariant Sections with translations requires special permission from their copyright holders, but you may include translations of some or all Invariant Sections in addition to the original versions of these Invariant Sections. You may include a translation of this License provided that you also include the original English version of this License. In case of a disagreement between the translation and the original English version of this License, the original English version will prevail.

B.10. TERMINATION

You may not copy, modify, sublicense, or distribute the Document except as expressly provided for under this License. Any other attempt to copy, modify, sublicense or distribute the Document is void, and will automatically terminate your rights under this License. However, parties who have received copies, or rights, from you under this License will not have their licenses terminated so long as such parties remain in full compliance.

B.11. FUTURE REVISIONS OF THIS LICENSE

The Free Software Foundation may publish new, revised versions of the GNU Free Documentation License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. See http://www.gnu.org/copyleft/.

Each version of the License is given a distinguishing version number. If the Document specifies that a particular numbered version of this License "or any later version" applies to it, you have the option of following the terms and conditions either of that specified version or of any later version that has been published (not as a draft) by the Free Software Foundation. If the Document does not specify a version number of this License, you may choose any version ever published (not as a draft) by the Free Software Foundation.

B.12. How to use this License for your documents

To use this License in a document you have written, include a copy of the License in the document and put the following copyright and license notices just after the title page:

Copyright (c) YEAR YOUR NAME. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation; with the Invariant Sections being LIST THEIR TITLES, with the Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST. A copy of the license is included in the section entitled "GNU Free Documentation License".

If you have no Invariant Sections, write "with no Invariant Sections" instead of saying which ones are invariant. If you have no Front-Cover Texts, write "no Front-Cover Texts" instead of "Front-Cover Texts being LIST"; likewise for Back-Cover Texts.

If your document contains nontrivial examples of program code, we recommend releasing these examples in parallel under your choice of free software license, such as the GNU General Public License, to permit their use in free software.