If you are a java programmer you may want to look at plugging in your own
custom Streams into the Streaming API. The SQL stuff is built on top of the
Streaming API.

http://joelsolr.blogspot.com/2015/04/the-streaming-api-solrjio-basics.html

Joel Bernstein
http://joelsolr.blogspot.com/

On Fri, Jun 12, 2015 at 11:00 AM, Joel Bernstein <joels...@gmail.com> wrote:

> https://issues.apache.org/jira/browse/SOLR-7560, will almost support this
> in Solr 5.3. The compound function support won't be there yet though. But
> it will be there in the near future.
>
>
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Fri, Jun 12, 2015 at 9:30 AM, kingofhypocrites <
> kingofhypocri...@gmail.com> wrote:
>
>> I am migrating a database from SQL Server to Cassandra. Currently I have a
>> setup as follows:
>>
>> - Log data in Cassandra
>> - Summarize data in Spark and put into Cassandra summary tables
>> - Query data in Solr
>>
>> Everything fits beautifully until I need to do stats on groups. I am
>> hoping
>> to get this to work with Solr so I can stick to one database, but I am not
>> sure it's possible.
>>
>> If I had it in SQL Server, I could do it like so:
>> SELECT
>>     site_id,
>>     keyword,
>>     SUM(visits) as visits,
>>     CONVERT(DECIMAL(13, 3), SUM(bounces)) / SUM(visits) as bounce_rate,
>>     SUM(pageviews) as pageviews,
>>     CONVERT(DECIMAL(13, 3), SUM(pageviews)) / SUM(visits) as
>> avg_pages_per_visit
>> FROM
>>     report_all_keywords_daily
>> WHERE
>>     site_id = 55 AND date_key >= '20150606' AND date_key <= '20150608'
>> GROUP BY
>>     site_id, keyword
>> ORDER BY visits DESC
>>
>> Now I need to replicate this in Solr. The closest I could get to this is
>> by
>> using the Stats component and then using field collapsing.
>>
>> group=true&group.field=keyword&stats=true&stats.field=visits&stats.facet=keyword
>>
>> And here are some results I get back:
>> http://pastebin.com/raw.php?i=Fxhe2RA0
>>
>> However, I need to do able to divide certain metrics. I tried including
>> functions in the stats.field such as div(sum(bounce_rate), (sum(visits))
>> but
>> it doesn't recognize the functions. Also it seems to ignoring the paging
>> for
>> the stats results and returns all groups regardless.
>>
>> Ultimately I'd like something like this which is what I would get in SQL:
>> <http://lucene.472066.n3.nabble.com/file/n4211402/pic.png>
>>
>> Is this possible or do I have to give up on the prospect of using Solr? I
>> have to query this data dynamically so I can't pre-summarize all of it.
>>
>> To clarify I having the following two problems:
>> - Paging is ignored for stats data
>> - I can't figure out how to divide two stats together to get a third stat.
>> Note: In some cases I would need to be able to sort on this combined stat
>>
>>
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/Division-with-Stats-Component-when-Grouping-in-Solr-tp4211402.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>
>

Reply via email to