Questions tagged [dmoz]

The "Directory Mozilla" which is also known as the Open Directory Project (ODP).

DMOZ (or the Open Directory Project) is a collaborative effort by volunteers all over the world to maintain a directory of websites organized in a meaningful way.

13 questions
3
votes
1 answer

how to make use of rdf data of dmoz with mysql and php?

I have searched through the web, but still not be able to get a solid answer. My goal is to make use of rdf data from dmoz, so here are 2 questions: how to import the rdf into mysql? (any solid tool could be used?) how to make the data queryable…
leliao
  • 91
  • 1
  • 3
2
votes
2 answers

How to detect site listing in DMOZ directory from PHP

How can I detect if particular site is listend in DMOZ directory from PHP ?
chubbyk
  • 6,212
  • 13
  • 53
  • 67
2
votes
0 answers

Convert dmoz directory to mind map tree

www.dmoz.org provides comprehensive web category in RDF file here http://www.dmoz.org/rdf.html I would like to create a mind map(using tools like free mind for example) or a visual tree using the RDF file from above link. Is there any easy way for…
T_H
  • 49
  • 4
2
votes
1 answer

Parsing DMOZ dumps for category queries in Python

I am currently working on a project which involves finding the 'domains of knowledge' a certain key-word is related to. I plan to do this using DMOZ. For example, 'Brad Pitt' gives Arts: People: P: Pitt, Brad: Fan Pages (10) Arts: People: P: Pitt,…
srjoglekar
  • 31
  • 4
0
votes
1 answer

Obtain website's description (meta) from ODP DMOZ.org using PHP

I am working on a php website. What I want to do is to retireve description or meta information of websites. What I hope to do is to obtain the information from dmoz.org, is there a way of doing such a thing ? If not, then what are other…
Louis
  • 1,265
  • 1
  • 15
  • 22
0
votes
1 answer

Scrapy dmoz tutorial: _init_() takes at most 2 arguments (3 given)

PS C:\users\steve\tutorial> scrapy crawl dmoz Traceback (most recent call last): File "c:\python27\scripts\scrapy-script.py", line 9, in load_entry_point('scrapy==1.0.3', 'console_scripts', 'scrapy')() File…
0
votes
0 answers

jar file failed to be load "Failed to load class"

I am trying to parse a DMOZ file, using this code. However, after running the code, the application stops at the beginning, producing following exceptions: SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". SLF4J: Defaulting to…
lonesome
  • 2,503
  • 6
  • 35
  • 61
0
votes
1 answer

Exception when parsing structure.rdf.u8, using Jena

Model model = ModelFactory.createDefaultModel(); InputStream in = FileManager.get().open( "W:\\structure.rdf.u8" ); model.read(in, null); model.write(System.out); I use the above code, which is provided in the Jena documentation, to parse the…
lonesome
  • 2,503
  • 6
  • 35
  • 61
0
votes
2 answers

Dmoz/Monster algorithme to calculate count of each category and sub category?

I have to create a browse page like monter.com or dmoz directory. My problem is the counts for each category. What's the best practive to do a similar thing? I am using PHP/MySQL Thx!
chahedous
  • 115
  • 3
  • 9
0
votes
0 answers

how to unpack dmoz urls from rdf dump with python and rdflib?

i tried to open rdf file (dmoz rdf dump), but a get this error message Traceback (most recent call last): File "/media/_dev_/ODP_RDF_get_links.py", line 4, in result = g.parse("data/content.rdf") File…
user2307154
0
votes
1 answer

Scrapy first tutorial dmoz returning en error "TypeError: Can't use implementer with classes. Use one of the class-declaration functions instead."

Getting an error when running the first tutorial for scrapy. Scrapy : 0.22.2 lxml : 3.3.5.0 libxml2 : 2.7.8 Twisted : 12.0.0 Python : 2.7.2 (default, Oct 11 2012, 20:14:37) - [GCC 4.2.1 Compatible Apple Clang 4.0…
0
votes
3 answers

how to obtain URLs from Dmoz ODP

I want to use a database of URLs present in DMOZ ODP for my application. ( an array of URL strings OR a file containing the same ). Is there any way of obtaining it , ( other than the manual copy-paste ) ? EDIT : Is there any script / code to parse…
trinity
  • 10,394
  • 15
  • 49
  • 67
0
votes
2 answers

scrapy tutorial: cannot run scrapy crawl dmoz

I'm asking a new question because I'm aware I wasn't clear enough in the last one. I'm trying to follow the scrapy tutorial, but I'm stuck in the crucial step, the "scrapy crawl dmoz' command. The code is this one (I have written that in the python…
user2668703
  • 21
  • 2
  • 4