[foaf-dev] [foaf-protocols] FOAF sites offline during cleanup

Kingsley Idehen kidehen at openlinksw.com
Wed Apr 29 15:57:01 CEST 2009

Steve Harris wrote:
> On 29 Apr 2009, at 14:06, Kingsley Idehen wrote:
>> Steve,
>> If we isolate the "FOAF Profiles" bubble of the LOD-Cloud pictorial, 
>> would you say these sources are representative:
>> 1. http://esw.w3.org/topic/FoafSites
>> 2. http://pingthesemanticweb.com  (PTSW)
>> 3. http://sindice.com
> It seems highly unlikely.
> The only way to get a representative sample is to select some of the 
> data randomly. 

So I end this thread by asking: isn't that basically what we have in our 
instance? Its data comes from the sources above plus others.
> ESW links a human-curated selection of sites, PTSW gets fed similarly 
> similarly and Sindice crawled, IIUC.
> I don't think anyone even has a good idea of how many FOAF files are 
> out there, to know if they have a good selection or not. I think we 
> have 12 million or so unique ones, but we know there's an awful lot 
> more out there.
> Ontop of that, "FOAF" is especially vague, eg. do qdos.com profiles 
> (eg. 
> http://qdos.com/user/Steve-Harris/18b6f60b41e05aaa418565ebfe901d6b/turtle) count 
> as FOAF profiles? They have foaf:People in them, and use one or two 
> foaf properties, but foaf: is not the most common prefix.
> What about DOAP files with lots of FOAF in them? Some use foaf: more 
> than doap:, and so on.
DOAP files are picked up from PTSW and a few other data sets that use FOAF.

Maybe we chat by phone of private IM (IRC, Twitter, Identi.ca etc about 
this) ?

> - Steve



Kingsley Idehen	      Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO 
OpenLink Software     Web: http://www.openlinksw.com

More information about the foaf-dev mailing list