Re: [R] implementing Maximum Likelihood with distrMod when only the PDF is known

Matthias Kohl Wed, 24 Jun 2009 03:32:46 -0700

Dear Guillaume,

retrospectively, I'm not sure if the decision to have spezial initializemethods is the optimal way to go. In distrMod and our other packages onrobust statistics we don't introduce special initialize methods, but useso-called generating functions. This approach has the advantage that thedefault initialize method can be used for programming where I find itvery useful.

I would say it is the canonical (and recommended) way to first define afunction as generic (like mu) and then implement methods for it.

Choosing names for which there is already an existing definition for ageneric function may also not be what you want in general. By definingnew methods you have to respect the definition of the generic function;that is, your method definition has to be with respect to the argumentsin the given generic function and also dispatching will be on thesearguments (cf. scale in the distrMod example).

In defining our classes we decided that at least the slot "r" has to befilled (of class "function") whereas d, p and q are of class"OptionalFunction". Hence, there are functions to fill d, p and q forgiven r but, so far not for filling r, p and q given d.


A way to avoid implementing r is given in
http://www.stamats.de/distrModExample1.R

I also do not fill the slots p and q in this example. This avoids thesimulation of a large random sample and speeds up the computation of theMLE.

However, this is rather a quick and dirty solution and it would ofcourse be better to have a valid definition for r, d, p and q.


Best,
Matthias


guillaume.martin schrieb:

Dear Mathias,
That's pretty amazing, thanks a lot ! I'll have to look all thisthrough because I don't easily understand why each part has to be setup, in particular the "initialize method" part seems crucial and isnot easy to intuit. From what I get, the actual name we give to aparameter (my original "mu" for example) is important in itself, andif we introduce new variable names we have to define a new generic,right? The simplest option then is to re-use an existing variable namethat has the same properties/range, right?
Another general question: my actual pdf is of the same type but notthe exact same as the skew normal. In particular, I don't have a rulefor building the slot r (eg the one borrowed from the sn package inyour example); is it a problem? isn't it sufficient to give slot d,and then you have automatic methods implemented to get from d() to r()slots etc. is that right?
Thanks a lot for your help and time !

Best,

Guillaume


Matthias Kohl a écrit :
Dear Guillaume,

thanks for your interest in the distrMod package.

Regarding your question I took up your example and put a file under:

http://www.stamats.de/distrModExample.R

Hope that helps ...

Don't hesitate to contact me if you have further questions!

Best,
Matthias

guillaume.martin schrieb:
Dear R users and Dear authors of the distr package and sequels
I am trying to use the (very nice) package distrMod as I want toimplement maximum likelihood (ML) fit of some univariate data forwhich I have derived a theoretical continuous density (pdf). As itis a parametric density, I guess that I should implement myself anew distribution of class AbscontDistributions (as stated in the pdfon "creating new distributions in distr"), and then useMLEstimator() from the distrMod package. Is that correct or is therea simpler way to go? Note that I want to use the distr packagebecause it allows me to implement simply the convolution of mytheoretical pdf with some noise distribution that I want to model inthe data, this is more difficult with fitdistr or mle.It proved rather difficult for me to implement the new classfollowing all the steps provided in the example, so I am asking ifsomeone has an example of code he wrote to implement a parametricdistribution from its pdf alone and then used it with MLEstimator().
I am sorry for the post is a bit long but it is a complicatequestion to me and I am not at all skillful in the handling of suchnotions as "S4 - class", etc.. so I am a bit lost here..
As a simple example, suppose my theoretical pdf is the skew normaldistribution (available in package sn):
#skew normal pdf (default values = the standard normal N(0,1)
fsn<-function(x,mu=0,sd=1,d=0) {u = (x-mu)/sd; f =dnorm(u)*pnorm(d*u); return(f/sd)}
# d = shape parameter (any real), mu = location (any real), sd =scale (positive real)
#to see what it looks like try
x<-seq(-1,4,length=200);plot(fsn(x,d=3),type="l")
#Now I tried to create the classes "SkewNorm" and"SkewNormParameter" copying the example for the binomial
##Class:parameters
setClass("SkewNormParameter",
representation=representation(mu="numeric",sd="numeric",d="numeric"),
prototype=prototype(mu=0,sd=1,d=0,name=gettext("Parameter of theSkew Normal distribution")),
contains="Parameter"
)
##Class: distribution (created using the pdf of the skew normaldefined above)
setClass("SkewNorm",prototype = prototype(
    d = function(x, log = FALSE){fsn(x, mu=0, sd=1,d=0)},
    param = new("SkewNormParameter"),
    .logExact = TRUE,.lowerExact = TRUE),
contains = "AbscontDistribution"
)

#so far so good but then with
setMethod("mu", "SkewNormParameter", function(object) obj...@mu)

#I get the following error message:
> Error in setMethod("mu", "SkewNormParameter", function(object)obj...@mu) : no existing definition for function "mu"
I don't understand because to me mu is a parameter not a function...maybe that is too complex programming for me and I should switch toimplementing my likelihood by hand with numerical convolutions andoptim() etc., but I would like to know how to use distr, so if thereis anyone who had the same problem and solved it, I would be verygrateful for the hint !
All the best,
Guillaume


--
Dr. Matthias Kohl
www.stamats.de

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] implementing Maximum Likelihood with distrMod when only the PDF is known

Reply via email to