5

I want to index a large number of pdf documents.

I have found a reference showing that it could be done using Apache Tika but unfortunately I cannot find any reference that describes I could configure Apache Tika in Solr 1.4.1.

Once configured I do have it configured, how can I send documents to Solr directly without using curl?

I am using solrnet for indexing.

Rob Kielty
  • 7,958
  • 8
  • 39
  • 51
Ahsan Iqbal
  • 1,422
  • 5
  • 20
  • 39

2 Answers2

5

See ExtractingRequestHandler

Pascal Dimassimo
  • 6,908
  • 1
  • 37
  • 34
3

Support for ExtractingRequestHandler in SolrNet is not yet complete. You can either finish implementing it, or work around it and craft your own HttpWebRequests.

Mauricio Scheffer
  • 98,863
  • 23
  • 192
  • 275