Possible Duplicate:
How to extract text from the PDF document?
Problem / Application: I am building a system in PHP/Java on a Windows 2008 Server running Apache. The concept is that a user will upload a PDF file. I then want the system to analyze the uploaded PFD file and generate a Title/Description using a algorithm I am going to design. Later my search engine will be able to search through the stored titles/descriptions to find PDF's relavent to the search. This will allow me to search stored PDF files without accessing the PDF's during the search.
What I need is a script or code that converts the PDF to text and store it to an array or something that I can then break down to get what I need.
I've found other threads that use unix/linux command line techniques. However I haven't found any scripts that will allow me to do what I need for Apache servers on Windows.
Any suggestions or alternative techniques I could use for this would be greatly appreciated!