Now, Suppose that I add in the additional information that the first k derivatives of f are all non-negative, for some moderate k (maybe 2, 3, 4; something like that). Are there any worthwhile tricks which could speed things up in this case? - ThreadSky

spmontecarlo.bsky.social • 50 days ago

Now, Suppose that I add in the additional information that the first k derivatives of f are all non-negative, for some moderate k (maybe 2, 3, 4; something like that). Are there any worthwhile tricks which could speed things up in this case?

Comments

ltronneberg.bsky.social•50 days ago

Are you assuming you are able to compute the derivatives as well, or simply that you know they are non-negative?

spmontecarlo.bsky.social•50 days ago

I know them, yeah. I basically have something like

f(x) = poly(x) + x^k / 1 - x

where the coefficients of poly are positive.

zabong69.bsky.social•50 days ago

Newton or Secant method will work fine, both converging faster in all likelihood than bisection.

spmontecarlo.bsky.social•50 days ago

I'd like to do it all very hands-off, so bisection seemed sensible if unambitious. If there's reason to believe that Newton will be automatically stable (which some other comments hint at), then it's indeed a natural choice.

zabong69.bsky.social•50 days ago

Keep us posted, this is interesting.

zabong69.bsky.social•50 days ago

Don’t dismiss Secant method. It’s a bit less aggressive than Newton, but it doesn’t need a derivative at all, which can make it faster in practice (less computations of f).

dirque.bsky.social•50 days ago

Newton should do the trick if initialized large enough. You'll get a strictly decreasing sequence converging towards a root if I'm not mistaken

zabong69.bsky.social•50 days ago

If you can use the first derivative anyway, then why not use Newton Raphson instead of Bisection?

spmontecarlo.bsky.social•50 days ago

Assume that f is basically a polynomial, so it's not appealing to do something like "call a polynomial solver" as a sub-routine; I am really only interested in something very low-tech and fast. The problem should morally be very easy, which is why I think that such a solution might be available.

spmontecarlo.bsky.social•50 days ago

In the end, I'm probably interested in solving this system for many values of t > 0, so in principle one could get a low amortised cost in various ways. I'd be most keen to find a solution which is sensible even for a single solve, though.

betanalpha.bsky.social•50 days ago

My limited intuition is that one could use these constraints to build an aggressive line search. Monoticity tells you which direction to search and the bounds on the derivatives can be used to construct an appropriate order polynomial whose zeros tell you how far to extrapolate.

echostatements.bsky.social•50 days ago

Rough idea:

If its second derivative is non-negative, its first derivative is monotonic and I think its therefore convex.

So f is lower bounded by the tangent at any points sampled

If you sample x_0 and f(x_0) < t, rather than bisecting [x_0, 1], bisect [where the tangent at f(x_0) reaches t, 1]

echostatements.bsky.social•50 days ago

For higher derivatives, it feels like the trick should extend to higher order curves than tangent lines, but I've not worked out even this much rigorously enough to be super-confident in it

spmontecarlo.bsky.social•50 days ago

I imagine it does, though in the case that I'm considering, checking whether the higher-order Taylor polynomial exceeds t is going to be a similar cost to checking with f. It's a slightly weird request in that way.

echostatements.bsky.social•50 days ago

Ah yeah, that's a good point.

And I was right not to be super-confident... I now think what I suggested gives you the new search area [x_0, new point] rather than [new_point, 1]... Hopefully you get what I was going for 😅

nmboffi.bsky.social•50 days ago

i'd guess newton's method for root-finding might be a good choice here, as the convergence theory relies on bounds on the derivatives, though of course the convergence conditions are pretty non-prescriptive so you'll just have to try it. could also combine with an initial bisection warm start.

rishav84ia.bsky.social•49 days ago

Check out Halley’s method that has a cubic convergence rate. Newton’s method has quadratic convergence and bisect method has linear convergence in comparison

spmontecarlo.bsky.social•48 days ago

Similar situation to https://bsky.app/profile/spmontecarlo.bsky.social/post/3lfd6n55y622n; more interested in using the monotonicity structure than the derivatives per se.

alexxthiery.bsky.social•49 days ago

I recently had to implement the Brent method (for finding the inverse cumulative function of a distribution) and it can sometimes be quite much faster than the bissection method:
https://en.m.wikipedia.org/wiki/Brent%27s_method

Comments

Posting Rules

Reply