I have a Sqlserver database with blobs
Some of the blobs are images, but the majority are MSWord documents or pdf documents
I am looking for a solution to search text (keyphrases) within these blobs
A possible solution could be a utility that would extract the blobs to a temporary directory during night time, then we could use Index server, google desktop or similar component to build an index of keywords present in the text and we would populate appropriate new tables in Sqlserver
A commercial programs is an other acceptable solution (we can buy a licence). I would need help to put it in place.
We are using Sqlserver2000 - no immediate plans to upgrade
Any programming must be done in either TSql or VBnet. We prefer MS components (I am very reluctant to use MySql for example)
The solution must be Sqlserver compatible (ie: it must be easy to call it from Sqlserver so as to integrate the search with other functions/fields...) a web service could be acceptable too
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
VBnet /TSql