[rdfweb-dev] The toolset issue

Julian Bond julian_bond at voidstar.com
Tue Jun 8 09:47:20 UTC 2004


I did some digging around in RAP and Morten's code last night. If we're 
looking at including it in something like Drupal then we can chuck out 
RDQL support, all the database stuff, all the N3 stuff and the RDF to 
XML serialisers. That gets you down to 13 files and 144Kb. It's a matter 
of minutes to combine what's left into two files. A few more minutes to 
remove all the copious comments and it's two files and 120Kb. That's not 
so bad. RAP's only getting version updates every 6 months or so, so 
staying in step wouldn't be hard.

In this case, the important timings in Morten's work are 25 secs for 
full RAP and 17 secs for Morten's cut down version. The 25% difference 
is all in replacing an array of triples as objects against a simple 
array of triples as arrays. But in the process you also chuck out the 
RAP API which means considerably more code to actually do useful work 
with the results. The remaining 17 secs vs 2.7 secs for Redland is 
entirely basic RDF parsing in PHP vs C. At this point, we're stuck with 
either trying to completely re-write the RDF parser, switching to 
Redland or just putting up with the performance. There maybe some 
performance to be had in optimising RAP's basic parser but that's beyond 
me.

I've got some big FOAF files so I'll try and get some real world timing 
on how long RAP takes to build the triple array and index it.

-- 
Julian Bond Email&MSM: julian.bond at voidstar.com
Webmaster:                 http://www.ecademy.com/
Personal WebLog:          http://www.voidstar.com/
M: +44 (0)77 5907 2173      T: +44 (0)192 0412 433



More information about the foaf-dev mailing list