It sounds like you want a data warehouse, not a text search engine.
Splunk and Pentaho are good things to try.
On Thu, Apr 29, 2010 at 12:03 PM, Jon Baer wrote:
> To follow up it ... it seems dumping to Solr is common ...
>
> http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-
To follow up it ... it seems dumping to Solr is common ...
http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data
- Jon
On Apr 29, 2010, at 1:58 PM, Jon Baer wrote:
> Good question, +1 on finding answer, my take ...
>
> Depending on how large of log files y
Good question, +1 on finding answer, my take ...
Depending on how large of log files you are talking about it might be better
off to do this w/ HDFS / Hadoop (and a script language like Pig) (or Amazon EMR)
http://developer.amazonwebservices.com/connect/entry.jspa?externalID=873
Theoretically y