<?xml version="1.0" encoding="UTF-8" ?>




<!--  RSS generated by JIRA 108 at Wed May 22 02:28:44 CEST 2013 -->
<rss version="0.92">





<channel>
    <title>openRDF.org Issue Tracker</title>
    <link>http://www.openrdf.org/issues</link>
    <description>This file is an XML representation of some issues</description>
    <language>en</language>

    
<item>

    







<title>[RIO-29] blank node handling in parsers doesn&apos;t scale</title>
<link>http://www.openrdf.org/issues/browse/RIO-29</link>

    
        <description>The processing of blank nodes in Rio&apos;s parsers doesn&apos;t scale for documents with huge amount of named blank nodes in it. The reason for this is that the parsers keep track of all encountered named blank nodes as they may be referenced again later in the document. As blank node names scope to the entire document, the same blank node name must map to the same blank node objects. Possibly the best way to solve this is to do the mapping in an external handler. One such handler might use a database or the file system to store mapping information instead of keeping this information in main memory.</description>
    
    
        <environment></environment>
    
        <key id="10379">RIO-29</key>
        <summary>blank node handling in parsers doesn&apos;t scale</summary>
        <type id="4">Improvement</type>
    
        <priority id="4">Minor</priority>
    
        <status id="1">Open</status>
        
        <resolution>Unresolved</resolution>
        
    
        
        <assignee>Unassigned</assignee>
        
    

    
        
        <reporter username="arjohn">Arjohn Kampman</reporter>
        
    

        
        <created>Wed, 21 Sep 2005 12:54:57 +0200 (CEST)</created>
    <updated>Wed, 21 Sep 2005 12:54:57 +0200 (CEST)</updated>

    
        
        
    

    
        
        
    

    
        
        
            
            
                
                    <component>NTriples parser</component>
                
                    <component>RDF/XML parser</component>
                
                    <component>Turtle parser</component>
                
            
        
    

    
    
        <due></due>
    
    
        <votes>0</votes>
    
    

    
    
    

    




    
    
    

</item>
    
<item>

    







<title>[RIO-47] Reduce memory requirements for verifying uniqueness of rdf:ID attributes</title>
<link>http://www.openrdf.org/issues/browse/RIO-47</link>

    
        <description>Source: &lt;a href=&quot;http://www.openrdf.org/forum/mvnforum/viewthread?thread=967&quot;&gt;http://www.openrdf.org/forum/mvnforum/viewthread?thread=967&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
The RDF/XML specification requires the values of rdf:ID attributes combined with the applicable base URI to be unique within a single RDF/XML document[1]. The RDF/XML parser currently stores these ID-base URI-combinations as URIs in a HashMap. As a result, a considerable amount of memory is needed to be able to parse documents that contain large amounts of rdf:ID attributes, such as the Uniprot benchmark files[2]. As the number of base URIs normally is much smaller than the number of IDs it is possible to reduce the memory requirements by storing the IDs per base URI so that the base URI only needs to be stored once.&lt;br/&gt;
&lt;br/&gt;
[1] &lt;a href=&quot;http://www.w3.org/TR/2004/REC-rdf-syntax-grammar-20040210/#section-constraints-id&quot;&gt;http://www.w3.org/TR/2004/REC-rdf-syntax-grammar-20040210/#section-constraints-id&lt;/a&gt;&lt;br/&gt;
[2] &lt;a href=&quot;http://expasy3.isb-sib.ch/~ejain//rdf/data-for-benchmarks/&quot;&gt;http://expasy3.isb-sib.ch/~ejain//rdf/data-for-benchmarks/&lt;/a&gt;</description>
    
    
        <environment></environment>
    
        <key id="10530">RIO-47</key>
        <summary>Reduce memory requirements for verifying uniqueness of rdf:ID attributes</summary>
        <type id="4">Improvement</type>
    
        <priority id="4">Minor</priority>
    
        <status id="1">Open</status>
        
        <resolution>Unresolved</resolution>
        
    
        
        <assignee>Unassigned</assignee>
        
    

    
        
        <reporter username="arjohn">Arjohn Kampman</reporter>
        
    

        
        <created>Wed, 14 Jun 2006 12:24:04 +0200 (CEST)</created>
    <updated>Wed, 14 Jun 2006 12:24:04 +0200 (CEST)</updated>

    
        
        
    

    
        
        
    

    
        
        
            
            
                
                    <component>RDF/XML parser</component>
                
            
        
    

    
    
        <due></due>
    
    
        <votes>0</votes>
    
    

    
    
    

    




    
    
    

</item>
    
</channel>
</rss>  

