[ 
https://issues.apache.org/jira/browse/PIO-45?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15875117#comment-15875117
 ] 

Pat Ferrel commented on PIO-45:
-------------------------------

Yeah, sorry, it was a bad example but does follow the rules above. These rules 
are built into PIO for aggregated queries so refer to actual behavior when in 
doubt.

Also there seems to be a problem where one event in the test dataset that 
should not be dropped is, I'm still tracking this down.

The timestamp on any de-duped event should be the most recent of the dups. 
Otherwise 2 trims may drop an event that shouldn't be. I haven't verified the 
correct behavior here but since we are close to a release I thought I'd bring 
it up while I verify.

To the best of my knowledge the README.md of the Template describes how PIO 
should or does work.

> SelfCleaningDatasource erases all data
> --------------------------------------
>
>                 Key: PIO-45
>                 URL: https://issues.apache.org/jira/browse/PIO-45
>             Project: PredictionIO
>          Issue Type: Bug
>    Affects Versions: 0.10.0-incubating
>            Reporter: Pat Ferrel
>            Assignee: Alexander  Merritt
>            Priority: Blocker
>             Fix For: 0.11.0
>
>         Attachments: import_handmade_simple.py, 
> sample-time-window-and-downsample-data.txt
>
>
> as integrated into the UR, in the integration-test, the SelfCleaningDataset 
> erases all data. This feature works fine in the AML version of PIO.
> Although not tested one could assume that this would be true with any other 
> Datasource in other templates.
> [~emergentorder] can you check to see if the PIO merge was done correctly.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to