Using Lucene Properties for Query Properties on Windows
I am planning to use Apache lucense in one of my projects, I want to index files based on file properties (I will not index data), and I want lucense to query the index so that I can quickly find a list of files based on properties.
For example: give me all files with access times greater than 10/10/2005 and access times less than 10/04/2010 and a file generated by james.
Can I use Lucene for projects like this? or am I better off using windows search (fake fingerprint is very heavy almost 5MB :() and I have to bundle this as part of my application, it seems difficult.
Can you suggest if there are alternatives here?
Is it possible to use apache POI or Apache Tika so I don't have to do a lot of processing?
a source to share
Lucene is definitely a valid option for indexing file properties, I have done something very similar in the past (search for images based on image properties).
I'm a bit concerned about getting properties. Are you planning on using the API in the File class? Looking around quickly, I found the APIs to be very limited; as a result, you probably get those properties elsewhere and store them in the staging environment.
An alternative to Lucene is Sphinx, it seems to be easier (based on my experience and observation, Lucene is better for large datasets in the millionth range). I've never worked with the Sphinx, but I've heard good things. It might be worth exploring before committing.
a source to share