Hi all, I have data which are unfortuantely comtaminated by noise.
We knew that the noise is at different level than the correct data, i.e. the noise data can be easily picked out by human eyes. It looks as if there are two people that generated the two very different data with different mean levels, and they got mixed together. i.e. assming the two data are following unknown distribution DF, and the two mean levels are u1 and u2... (unknown) Then the correct data are generated by DF(u1) and the noise are generated by DF(u2), and they got mixed... Now, how do I flag those suspicious data? At least is there a way I could answer the question: Given a sample of mixed data - are these data generated from the above-mentioned two sources, or the data are indeed generated from one source only. i.e. are there two substantially distinct species in the given data? Thanks a lot! [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.