Re: [R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)

Duncan Murdoch Sat, 10 Jul 2010 17:42:29 -0700

On 10/07/2010 7:32 PM, Ravi Varadhan wrote:

Hi,


The best solution would be to identify where the problem is in the FORTRAN code 
and correct it.  However, this problem of premature termination due to absolute 
function convergence is highly unlikely to occur in practice.  As John Nash noted, 
this is going to be highly unlikely for multi-dimensional parameters (it is also 
unlikely for one-dimensional problem).  However, unless we understand the source 
of the problem, we cannot feel comfortable in saying with absolute certainty that 
this will not occur for n > 1.  Therefore, I would suggest that either we fix 
the problem at its source or we set abs.tol=0, since there is little harm in doing 
so.

Just for future reference: that's not the kind of answer that leads toanything getting done. So I'll leave it to the authors of nlminb.


Duncan Murdoch

Ravi.

____________________________________________________________________

Ravi Varadhan, Ph.D.
Assistant Professor,
Division of Geriatric Medicine and Gerontology
School of Medicine
Johns Hopkins University

Ph. (410) 502-2619
email: rvarad...@jhmi.edu


----- Original Message -----
From: Duncan Murdoch <murdoch.dun...@gmail.com>
Date: Saturday, July 10, 2010 7:32 am
Subject: Re: [R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)
To: Ravi Varadhan <rvarad...@jhmi.edu>
Cc: Matthew Killeya <matthewkill...@googlemail.com>, Peter Ehlers 
<ehl...@ucalgary.ca>, r-help@r-project.org, ba...@stat.wisc.edu
Ravi Varadhan wrote:
 >Hi,
 >
>The absolute function stopping criterion is not meant for anypositive objective function. It is meant for functions whose minimumis 0. Here is what David Gay's documentation from PORT says:
 >
>"6 - absolute function convergence: |f (x)| < V(AFCTOL) = V(31).This test is only of interest in>problems where f (x) = 0 means a ‘‘perfect fit’’, such as nonlinearleast-squares problems.">Okay, I've taken a more careful look at the docs, and they do saythat the return code 6 does not necessarily indicate convergence:"The desirable return codes are 3, 4, 5, and sometimes 6". So weshouldn't by default terminate on it, we should allow users to choosethat if they want faster convergence to perfect fits.Would changing the default for abs.tol to zero be a reasonable solution?Duncan Murdoch
 >For example, let us try a positive objective function:
 >
>>>nlminb( obj = function(x) x^2 + 1, start=1, lower=-Inf, upper=Inf,control=list(trace=TRUE))> 0: 2.0000000: 1.00000
 >  1:     1.0000000:  0.00000
 >  2:     1.0000000:  0.00000
 >$par
 >[1] 0
 >
 >$objective
 >[1] 1
 >
 >$convergence
 >[1] 0
 >
 >$message
 >[1] "relative convergence (4)"
 >
 >$iterations
 >[1] 2
 >
 >$evaluations
>function gradient 3 2>>Here the absolute function criterion does not kicks in.>
 >Now let us try a function whose minimum value is 0.
 >
>>>nlminb( obj = function(x) x^2, start=6, grad=function(x) 2*x,lower=-Inf, upper=Inf, control=list(trace=TRUE) )>>> 0: 36.000000: 6.00000
 >  1:     4.0000000:  2.00000
 >  2: 4.9303807e-32: 2.22045e-16
 >$par
 >[1] 2.220446e-16
 >
 >$objective
 >[1] 4.930381e-32
 >
 >$convergence
 >[1] 0
 >
 >$message
 >[1] "absolute function convergence (6)"
 >
 >$iterations
 >[1] 2
 >
 >$evaluations
>function gradient 4 3>We see that convergence is attained and that the stoppage is due toabsolute function criterion.>Suppose, we now set abs.tol=0:
 >
>>>nlminb( obj = function(x) x^2, start=6, grad=function(x) 2*x,lower=-Inf, upper=Inf, control=list(trace=TRUE, abs.tol=0) )>>> 0: 36.000000: 6.00000
 >  1:     4.0000000:  2.00000
 >  2: 4.9303807e-32: 2.22045e-16
 >  3: 2.4308653e-63: -4.93038e-32
 >  4: 2.9962729e-95: -5.47382e-48
 >  5:1.4772766e-126: 1.21543e-63
 >  6:1.8208840e-158: 1.34940e-79
 >  7:8.9776511e-190: -2.99627e-95
 >  8:1.1065809e-221: -3.32653e-111
 >  9:5.4558652e-253: 7.38638e-127
 > 10:6.7248731e-285: 8.20053e-143
 > 11:3.3156184e-316: -1.82088e-158
 > 12:     0.0000000: -2.02159e-174
 > 13:     0.0000000: -2.02159e-174
 >$par
 >[1] -2.021587e-174
 >
 >$objective
 >[1] 0
 >
 >$convergence
 >[1] 0
 >
 >$message
 >[1] "X-convergence (3)"
 >
 >$iterations
 >[1] 13
 >
 >$evaluations
>function gradient 15 13> Now, we see that it takes a while to stop, eventhough it is clearthat convergence has been attained after 2 iterations. Thisdemonstrates the need for the absolute function criterion for objfunctions whose minimum is exactly 0. Although, there is nothingwrong with setting abs.tol=0, except for some loss of computationalefficiency.>Now, let us get back to Matthew' example:
 >
>>>nlminb( obj = function(x) x, start=1, lower=-2, upper=2,control=list(trace=TRUE))> 0: 1.0000000: 1.00000
 >  1:     0.0000000:  0.00000
 >$par
 >[1] 0
 >
 >$objective
 >[1] 0
 >
 >$convergence
 >[1] 0
 >
 >$message
 >[1] "absolute function convergence (6)"
 >
 >$iterations
 >[1] 1
 >
 >$evaluations
>function gradient 2 2>>>nlminb( obj = function(x) x, start=1, lower=-2, upper=2,control=list(trace=TRUE, abs.tol=0))> 0: 1.0000000: 1.00000
 >  1:     0.0000000:  0.00000
 >  2:    -2.0000000: -2.00000
 >  3:    -2.0000000: -2.00000
 >$par
 >[1] -2
 >
 >$objective
 >[1] -2
 >
 >$convergence
 >[1] 0
 >
 >$message
 >[1] "both X-convergence and relative convergence (5)"
 >
 >$iterations
 >[1] 3
 >
 >$evaluations
>function gradient 3 3>>Thus it is evident that setting abs.tol=0 is a reasonable, generalsolution for functions whose minimum value is non-zero, because itprotects against premature termination at iteration `n' whenever|f(x_n)| < abs.tol. The only limitation being that of loss ofefficiency in problems where f(x*) = 0. where x* is the local minimum.
 >
 >Ravi.
 >____________________________________________________________________
 >
 >Ravi Varadhan, Ph.D.
 >Assistant Professor,
 >Division of Geriatric Medicine and Gerontology
 >School of Medicine
 >Johns Hopkins University
 >
 >Ph. (410) 502-2619
 >email: rvarad...@jhmi.edu
 >
 >
 >----- Original Message -----
 >From: Duncan Murdoch <murdoch.dun...@gmail.com>
 >Date: Friday, July 9, 2010 6:54 pm
>Subject: Re: [R] Not nice behaviour of nlminb (windows 32 bit,version 2.11.1)
 >To: Matthew Killeya <matthewkill...@googlemail.com>
>Cc: Peter Ehlers <ehl...@ucalgary.ca>, Ravi Varadhan<rvarad...@jhmi.edu>, r-help@r-project.org, ba...@stat.wisc.edu
 >
 >
>>>On 09/07/2010 6:09 PM, Matthew Killeya wrote:>> >Yes clearly a bug... there are numerous variations ... problemseems to be
 >> >for a linear function whenever the first function valuation is 1.
>> > Not at all. You can get the same problem on a quadratic thathappens to have a zero at an inconvenient place, e.g.>> nlminb( obj = function(x) x^2-25, start=6, lower=-Inf, upper=Inf)>> Ravi's workaround of setting the abs.tol to zero fixes thisexample, but I think it's pretty clear from the documentation that thewhole thing was designed for positive objective functions, so Iwouldn't count on his workaround solving all the problems.
 >>  Duncan Murdoch
 >>   >e.g. two more examples:
 >> > nlminb( obj = function(x) x+1, start=0, lower=-Inf, upper=Inf )
 >> > nlminb( obj = function(x) x+2, start=-1, lower=-Inf, upper=Inf )
 >> >
 >> >(I wasn't sure where best to report a bug, so emailed the help list)
 >> >
 >> >On 9 July 2010 22:10, Peter Ehlers <ehl...@ucalgary.ca> wrote:
 >> >
 >> >   >>Actually, it looks like any value other than 1.0
 >> >>(and in (lower, upper)) for start will work.
 >> >>
 >> >> -Peter Ehlers
 >> >>
 >> >>
 >> >>On 2010-07-09 14:45, Ravi Varadhan wrote:
 >> >>
>> >> >>>Setting abs.tol = 0 works! This turns-off the absolutefunction
 >> >>>convergence
 >> >>>criterion.
 >> >>>
 >> >>>
 >> >>> nlminb( objective=function(x) x, start=1, lower=-2, upper=2,
 >> >>>      control=list(abs.tol=0))
 >> >>>$par
 >> >>>[1] -2
 >> >>>
 >> >>>$objective
 >> >>>[1] -2
 >> >>>
 >> >>>$convergence
 >> >>>[1] 0
 >> >>>
 >> >>>$message
 >> >>>[1] "both X-convergence and relative convergence (5)"
 >> >>>
 >> >>>$iterations
 >> >>>[1] 3
 >> >>>
 >> >>>$evaluations
 >> >>>function gradient
 >> >>>       3        3
 >> >>>
 >> >>>
 >> >>>This is clearly a bug.
 >> >>>
 >> >>>
 >> >>>Ravi.
 >> >>>
 >> >>>-----Original Message-----
 >> >>>From: r-help-boun...@r-project.org [
 >> >>>On
 >> >>>Behalf Of Ravi Varadhan
 >> >>>Sent: Friday, July 09, 2010 4:42 PM
 >> >>>To: 'Duncan Murdoch'; 'Matthew Killeya'
 >> >>>Cc: r-help@r-project.org; ba...@stat.wisc.edu
>> >>>Subject: Re: [R] Not nice behaviour of nlminb (windows 32 bit,version
 >> >>>2.11.1)
 >> >>>
>> >>>Duncan, `nlminb' is not intended for non-negative functionsonly. There
 >> >>>is
 >> >>>indeed something strange happening in the algorithm!
 >> >>>
 >> >>>start<- 1.0 # converges to wrong minimum
 >> >>>
 >> >>>startp<- 1.0 + .Machine$double.eps  # correct
 >> >>>
 >> >>>startm<- 1.0 - .Machine$double.eps  # correct
 >> >>>
 >> >>> nlminb( objective=obj, start=start, lower=-2, upper=2)
 >> >>>      $par
 >> >>>[1] 0
 >> >>>
 >> >>>$objective
 >> >>>[1] 0
 >> >>>
 >> >>>$convergence
 >> >>>[1] 0
 >> >>>
 >> >>>$message
 >> >>>[1] "absolute function convergence (6)"
 >> >>>
 >> >>>$iterations
 >> >>>[1] 1
 >> >>>
 >> >>>$evaluations
 >> >>>function gradient
 >> >>>       2        2
 >> >>>
 >> >>>
 >> >>>       >>>>nlminb( objective=obj, start=startp, lower=-2, upper=2)
 >> >>>>
 >> >>>>         >>>$par
 >> >>>[1] -2
 >> >>>
 >> >>>$objective
 >> >>>[1] -2
 >> >>>
 >> >>>$convergence
 >> >>>[1] 0
 >> >>>
 >> >>>$message
 >> >>>[1] "both X-convergence and relative convergence (5)"
 >> >>>
 >> >>>$iterations
 >> >>>[1] 3
 >> >>>
 >> >>>$evaluations
 >> >>>function gradient
 >> >>>       3        3
 >> >>>
 >> >>>
 >> >>>       >>>>nlminb( objective=obj, start=startm, lower=-2, upper=2)
 >> >>>>
 >> >>>>         >>>$par
 >> >>>[1] -2
 >> >>>
 >> >>>$objective
 >> >>>[1] -2
 >> >>>
 >> >>>$convergence
 >> >>>[1] 0
 >> >>>
 >> >>>$message
 >> >>>[1] "both X-convergence and relative convergence (5)"
 >> >>>
 >> >>>$iterations
 >> >>>[1] 3
 >> >>>
 >> >>>$evaluations
 >> >>>function gradient
 >> >>>       3        3
 >> >>>
 >> >>>
>> >>> From the convergence message the `absolute functionconvergence' seems to
 >> >>>      be
>> >>>the culprit, although I do not understand why that stoppingcriterion is>> >>>becoming effective, when the algorithm is started at x=1, butnot at any
 >> >>>other values.  The documentation in IPORT makes it clear that this
 >> >>>criterion
>> >>>is effective only for functions where f(x*) = 0, where x* is alocal>> >>>minimum. In this example, x=0 is not a local minimum for f(x),so that
 >> >>>criterion should not apply.
 >> >>>
 >> >>>
 >> >>>Ravi.
 >> >>>
 >> >>>
 >> >>>-----Original Message-----
 >> >>>From: r-help-boun...@r-project.org [
 >> >>>On
 >> >>>Behalf Of Duncan Murdoch
 >> >>>Sent: Friday, July 09, 2010 3:45 PM
 >> >>>To: Matthew Killeya
 >> >>>Cc: r-help@r-project.org; ba...@stat.wisc.edu
>> >>>Subject: Re: [R] Not nice behaviour of nlminb (windows 32 bit,version
 >> >>>2.11.1)
 >> >>>
 >> >>>On 09/07/2010 10:37 AM, Matthew Killeya wrote:
 >> >>>
>> >>> >>>> nlminb( obj = function(x) x, start=1, lower=-Inf,upper=Inf )
 >> >>>>
 >> >>>>
>> >>>> >>>If you read the PORT documentation carefully,you'll see that their>> >>>convergence criteria are aimed at minimizing positivefunctions. (They
 >> >>>never state this explicitly, as far as I can see.)  So one stopping
>> >>>criterion is that |f(x)|< abs.tol, and that's what it foundfor you. I
 >> >>>don't know if there's a way to turn this off.
 >> >>>
>> >>>Doug or Deepayan, do you know if nlminb can be made to work onfunctions
 >> >>>that go negative?
 >> >>>
 >> >>>Duncan Murdoch
 >> >>>
 >> >>> $par
 >> >>>       >>>>[1] 0
 >> >>>>
 >> >>>>$objective
 >> >>>>[1] 0
 >> >>>>
 >> >>>>$convergence
 >> >>>>[1] 0
 >> >>>>
 >> >>>>$message
 >> >>>>[1] "absolute function convergence (6)"
 >> >>>>
 >> >>>>$iterations
 >> >>>>[1] 1
 >> >>>>
 >> >>>>$evaluations
 >> >>>>function gradient
 >> >>>>       2        2
 >> >>>>
 >> >>>>       [[alternative HTML version deleted]]
 >> >>>>
 >> >>>>
 >> >>>>         >
>> >


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Not nice behaviour of nlminb (windows 32 bit, version 2.11.1)

Reply via email to