[
https://issues.apache.org/jira/browse/ATLAS-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16049305#comment-16049305
]
Graham Wallis commented on ATLAS-1868:
--------------------------------------
Hi Christian
I'm afraid I don't know the answer to the question about 0.8 in your last para.
I have only been involved with Atlas for a few weeks and am still trying to
find my way around :-) I will post back here if I find anything that might
explain it.
Regarding your suggestion on solr/elastic indexs, I think they are only used
for full-text searches whereas I'm thinking that to optimize your original
query, we should try to exploit a composite index and start the traversal from
the indexed vertex. I imagine that a rarity score from the full-text index
might be an alternative that would indicate the better of the two directions
for traversing. I'm currently rummaging through the various layers of graph
abstraction to work out where in Atlas either such an optimization would be
implemented. Up till now I have been too close to the graph - i.e. by that
stage the operations are lower level discrete lookups of vertices or edges. I
think I need to focus further up and find where the whole query is composed.
> Highly inefficient DSL-queries
> ------------------------------
>
> Key: ATLAS-1868
> URL: https://issues.apache.org/jira/browse/ATLAS-1868
> Project: Atlas
> Issue Type: Bug
> Components: atlas-core
> Affects Versions: 0.7-incubating
> Environment: linux, hbase + solr configuration.
> Reporter: Christian R
> Labels: dsl, gremlin
>
> The DSL query 'mytype where property.id = "id1"' appears to be rewritten as a
> gremlin query that resembles:
> g.V.has(typename, 'mytype'ยจ).as(x).out('property').has('id', 'id1').back('x')
> On our system this query takes 6-7 minutes. The query
> g.V.has('id', 'id1').in('property').has('typename', 'mytype')
> takes 350 milliseconds.
> Our graph:
> g.V.count() = 1359151
> We have atlas 0.7 installed. I've compiled the latest 0.9 code and looked at
> the generated gremlin query as reported in the logs for the same DSL-query,
> and I think 0.9 has the same performance issues. Unfortunately I don't have a
> big graph on a 0.9 installation to test performance.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)