Any way to index the contents of a PDF file with WebCatalog?
This WebDNA talk-list message is from 2000
It keeps the original formatting.
numero = 31859
interpreted = N
texte = Hello,I am trying to work on a catalog of documents type site. the majority ofdocuments are pdf files.The site will allow people to upload their documents and enter some basicinformation for sorting purposes. I need a way to index the content of thepdf files and put into a WebCatalog database so I can do free text searchesas well.Does anyone know of any way to get the text portion of a pdf file into aWebCatalog database short of copy and paste (which is not an option for thisproject).The end site will be on Solaris (if we can get WebCatalog to stay runningfor more than a few minutes). I will be doing most of the development onMacintosh (and yes, I am watching case of the filenames
). Of course ifany part of the process to get the data from the pdf file requires beingdone on Solaris only all of the development will move to that platform.Thank you.-- Dale Therio +49 69 263 19977 office Dresdner Kleinwort Benson Research+49 69 263 11379 fax Jürgen-Ponto-Platz 1+49 170 934 3610 mobile 60301 Frankfurt, Germany-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list .To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://search.smithmicro.com/
Associated Messages, from the most recent to the oldest:
|
- Any way to index the contents of a PDF file with WebCatalog? (dale@gmr.dresdner.net 2000)
|
Hello,I am trying to work on a catalog of documents type site. the majority ofdocuments are pdf files.The site will allow people to upload their documents and enter some basicinformation for sorting purposes. I need a way to index the content of thepdf files and put into a WebCatalog database so I can do free text searchesas well.Does anyone know of any way to get the text portion of a pdf file into aWebCatalog database short of copy and paste (which is not an option for thisproject).The end site will be on Solaris (if we can get WebCatalog to stay runningfor more than a few minutes). I will be doing most of the development onMacintosh (and yes, I am watching case of the filenames ). Of course ifany part of the process to get the data from the pdf file requires beingdone on Solaris only all of the development will move to that platform.Thank you.-- Dale Therio +49 69 263 19977 office Dresdner Kleinwort Benson Research+49 69 263 11379 fax Jürgen-Ponto-Platz 1+49 170 934 3610 mobile 60301 Frankfurt, Germany-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list .To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://search.smithmicro.com/
dale@gmr.dresdner.net
DOWNLOAD WEBDNA NOW!
Top Articles:
Talk List
The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...
Related Readings:
Shipping based on qty (1998)
easy numfound search? (2001)
Full text search (1999)
frames & carts (1997)
RE: PCS search results page (1998)
WebDNAmonitor running but webcat won't start. (2002)
Email check problems (1999)
Server Load (2000)
Credit back woes (2000)
Variables for chat (1997)
emailer w/F2 (1997)
[/application] error? (1997)
WCS Newbie question (1997)
FYI-WebCat Mac or NT (1997)
Problems setting MIME Headers (1998)
[math] variable question (1997)
NTbeta18 corrupted? (1997)
Sunday [search] puzzler (1999)
Database Advice (1996)
Virtual hosting and webcatNT (1997)