Polyvore (http://www.polyvore.com) is a well-funded startup company backed
by Benchmark Capital and Matrix Partners. It is a fast-growing site with
over 5 million unique visitors and 120 million pageviews per month. At
Polyvore, we strive to build products that delight people. By design, we
keep th
Hello,
We have an interesting and we guess common machine learning problem.
Polyvore (the company I work for) has a few million products in its index.
For each product we have the following information:
- keywords extracted from title, description, tags
- other meta information like colors, bran
Thanks for your reply. Nested boolean queries is a valid concern. I also
realized that isCoordDisabled needs to be considered in
BooleanQuery.hashCode so that a query with coord=false will have a different
cache key in Solr.
On Thu, Nov 12, 2009 at 12:12 PM, Chris Hostetter
wrote:
>
> : I want
Please ignore my posts. Log is quite expensive an operation...
On Mon, May 11, 2009 at 11:45 AM, Guangwei Yuan wrote:
> After spending more time on this, it seems more likely a problem from
> FunctionQuery. If using boost = log(100) takes 100ms, log(log(100)) adds
> another 100ms, lo
,
Guangwei
On Sat, May 9, 2009 at 12:31 PM, Guangwei Yuan wrote:
> Hi,
>
> I'm trying the BoostQParserPlugin and FunctionQuery to enable query time
> boosting. It works better than bf (boost function) because it multiplies
> the relevancy score by the boosts. However I noticed signi
Hi,
I'm trying the BoostQParserPlugin and FunctionQuery to enable query time
boosting. It works better than bf (boost function) because it multiplies
the relevancy score by the boosts. However I noticed significant
performance issues with it. The more functions I use as boosts, the slower
it ge
Hi,
I've noticed some bad performance in faceted browsing, when the query is
empty (so the MatchAllDocsQuery is used) and there are only filter queries.
An example of the search url is:
http://hostname:8080/solr/select/?q=&qt=dismax&fq=color:%2300
One idea is to switch to the StandardRequest
Hi,
I'm sending a local csv file to Solr via remote streaming, and constantly
get the "500 read timeout" message. The csv file is about 200MB in size, and
Solr is running on Tomcat 5.5. What types of timeout related Tomcat params I
can adjust to fix this?
Thanks in advance.
- Guangwei
Hi,
I'm sending a local csv file to Solr via remote streaming, and constantly
get the "500 read timeout" message. The csv file is about 200MB in size, and
Solr is running on Tomcat 5.5. What types of timeout related Tomcat params I
can adjust to fix this?
Thanks in advance.
- Guangwei
Hi,
Does Solr support empty queries? It'll be nice if Solr can return all
results if q is null. Otherwise, I guess I'll have to write a customized
request handler. Any thoughts?
Thanks in advance.
- Guangwei
>
> can you you explain exactly how you are indexing the data and what your
> query looks like?
>
I used the same field name (color), not 10 different names (c0 - c9).
So the index fields look like (50% #00, 20% #99):
color: #00
color: #00
color: #00
color: #00
color: #000
Thanks for all the replies. I think creating 10 fields and feeding each
field with a color's value for 10% from that color is a reasonable approach,
and easy to implement too. One problem though, is that not all products have
a total of 100% colors (due to various reasons including our color
extrac
Hi,
We're running an e-commerce site that provides product search. We've been
able to extract colors from product images, and we think it'd be cool and
useful to search products by color. A product image can have up to 5 colors
(from a color space of about 100 colors), so we can implement it easil
13 matches
Mail list logo