Hi all,

I have data which are unfortuantely comtaminated by noise.

We knew that the noise is at different level than the correct data, i.e.
the noise data can be easily picked out by human eyes.

It looks as if there are two people that generated the two very different
data with different mean levels, and they got mixed together.

i.e. assming the two data are following unknown distribution DF,

and the two mean levels are u1 and u2... (unknown)

Then the correct data are generated by DF(u1)

and the noise are generated by DF(u2),

and they got mixed...

Now, how do I flag those suspicious data? At least is there a way I could
answer the question:

Given a sample of mixed data - are these data generated from the
above-mentioned two sources, or the data are indeed generated from one
source only.

i.e. are there two substantially distinct species in the given data?

Thanks a lot!

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to