Optimize normalizeLongitude method #373

zongweil · 2020-01-07T05:50:34Z

Currently, our normalizeLongitude() methods use a while loop to get the longitude into an acceptable range. However, this can take a long time if the integrating application passes in a really large value for the longitude. We can resolve this while maintaining the same behavior by computing the normalized longitude in a single step. Something like:

if longitude < -180
    divisor = (int) longitude / -360
    longitude = longitude + ((divisor + 1) * 360)
else if longitude > 180
    divisor = (int) longitude / 360
    longitude = longitude - ((divisor + 1) * 360)
if longitude == 180
    longitude = -180

This is the master issue for tracking the work to be done across the various implementations. See #370 for more context.

The text was updated successfully, but these errors were encountered:

kpym · 2020-01-14T18:45:23Z

You don't need different cases, you can usually use something like mod(longitude + 180, 360) - 180. This code, like yours, has two drawbacks:

the code will be quite different between different languages,
this code is slower if the longitude is not very far from being normalized, which is, I think, the most used case.

bocops · 2021-04-21T14:58:07Z

If this is still an open issue, I suggest creating a project for it so that we can track this across all language versions.

fulldecent · 2021-04-25T04:14:51Z

Recommended tag: implementation // specification

fulldecent · 2021-04-25T05:05:14Z

I am considering to add to the OLC specification that longitude normalization MUST run in O(1) time.

I consider this a security issue.

fulldecent · 2021-04-25T05:23:43Z

Related #370

fulldecent · 2021-04-25T05:24:52Z

Added test cases at #444 for vulnerable behavior.

Related: #241

bocops · 2021-11-15T20:53:53Z

There are still several language versions that normalize via while-loop - for example c, cpp, python - so I'd suggest to keep this open and properly list all language versions that still need an update, as already requested above.

kpym · 2021-11-15T21:17:53Z

IMO, normalizing with loops is optimal, especially for longitudes close to the normal form. In any case making this kind of changes without bench-marking (against day to day cases) is not a good idea, I think.

fulldecent · 2021-11-15T22:22:26Z

Benchmarking is not necessary.

Using an input of 100e100 creates a DOS security vulnerability. This approach fixes it.

Using an extra MOD function increases computation cost by one arithmetic operation. We cannot reasonably expect any platform we are targeting to be harmed by an additional single instruction.

kpym · 2021-11-16T09:58:12Z

@fulldecent I see, I wasn't thinking about malicious use but about performance in "standard" cases. Maybe we should keep the best performance in reasonable cases and prevent malicious use by a time-limited algorithm only if the value is too large?

bocops · 2021-11-16T10:17:06Z

Quick back-of-the-envelope calculation:

The old algorithm has one comparison and one addition/subtraction per loop, plus another comparison for the loop it doesn't need to take. That's ~ 2x+1 operations, with x being the number of "full circles" the non-normalized longitude is off.

The new algorithm has two comparisons, plus if necessary two additions/subtractions, two modulo operations. That's a fixed amount of 6 operations in the worst case.

Looping is strictly worse if x>2, but adding x=1,2 as special cases to the new algorithm would mean that we'd still loop once or twice (2*2+1=5) after performing four comparisons - two to drop out early, two to loop instead of calculating directly. That means we'd optimize a 6 op algorithm by making it a 7 or 9 op algorithm.

kpym · 2021-11-16T13:01:16Z

I benchmarked two version of normalize in go :

// the 'loop' version
func normalize(value, max float64) float64 {
  for value < -max {
    value += 2 * max
  }
  for value >= max {
    value -= 2 * max
  }
  return value
}

// the 'mod' version
func normalize2(value, max float64) float64 {
  if value >= -max && value < max {
    return value
  }
  return math.Mod(value+max, 2*max) - max
}

The normalize version is around 4 times faster (3ns vs 14ns) up to value=2000 and continues to be faster up to value=20000. So if we do not want to loose 400% of speed for "normal" values, and want to bound the normalization time, we can combine both methods to obtain :

// threshold for the normalization method
const toobig float64 = 2e4

func normalize(value, max float64) float64 {
  // value's period
  period := 2 * max
  // if the value is too big use modulo
  if value < -toobig || value > toobig {
    return math.Mod(value+max, period) - max
  }
  // else use only additions to normalize
  for value < -max {
    value += period
  }
  for value >= max {
    value -= period
  }
  return value
}

fulldecent · 2021-11-16T13:50:19Z

Holding your phone closer to your face will make the calculation appear faster because the photons have less distance to travel to your eyes.

That is roughly the magnitude of speed difference between these two algorithms in normal operating cases.

In extreme cases (the attack scenario) one algorithm will execute "immediately" and the other will take longer on modern hardware than the lifespan of the Sun.

bocops · 2021-11-16T13:51:30Z

I'm not seeing the same when normalizing an array with 1M random entries using the original "non-optimized, looping" algorithm vs. the "non-looping" algorithm in Kotlin/JVM.

If entry values are at least somewhat reasonable (tested with +/- 500, 1000, 10000), looping over the array with either algorithm finishes in about the same time with negligible difference. If entry values are decidedly not reasonable (tested with +/- 5000000), the looping algorithm takes 50x as much time, which is exactly the DOS-style behaviour that prompted this whole issue.

At this point, I wonder what exactly we're optimizing for? If it is individual plus code encodings, even a 400% increase might not be noticed by anyone if we're talking about microseconds, so avoiding DOS and code readability should be priority. If it is encoding millions of entries we're concerned with, it might be better to sanitize those entries before encoding them.

For what it's worth, consider that applications such as Google Maps simply refuse to deal with non-normalized coordinates as input, so something like that is not really a concern.

fulldecent · 2021-11-16T14:25:30Z

This is not an optimization. This is a fix for a specific problem.

Anybody can optimize the time it takes to get in their house by removing the door.

Converting 1M Plus Codes is not a normal use case. That is a batch operation run in a server non-interactively.

If this was run interactively, then again the server would have more resources, parallel task running and again this would be on the order of time of a photon moving between a phone and an eyeball.

sonyaa · 2021-11-18T19:44:22Z

Reopening to track fixes for other languages.

zongweil added enhancement help wanted labels Jan 7, 2020

bocops mentioned this issue May 5, 2021

Normalize longitude without looping #460

Merged

sonyaa closed this as completed in #460 Nov 15, 2021

sonyaa reopened this Nov 18, 2021

drinckes removed the enhancement label Jun 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize normalizeLongitude method #373

Optimize normalizeLongitude method #373

zongweil commented Jan 7, 2020 •

edited

Loading

kpym commented Jan 14, 2020

bocops commented Apr 21, 2021

fulldecent commented Apr 25, 2021 •

edited

Loading

fulldecent commented Apr 25, 2021

fulldecent commented Apr 25, 2021

fulldecent commented Apr 25, 2021

bocops commented Nov 15, 2021

kpym commented Nov 15, 2021

fulldecent commented Nov 15, 2021

kpym commented Nov 16, 2021

bocops commented Nov 16, 2021

kpym commented Nov 16, 2021 •

edited

Loading

fulldecent commented Nov 16, 2021

bocops commented Nov 16, 2021

fulldecent commented Nov 16, 2021

sonyaa commented Nov 18, 2021

Optimize normalizeLongitude method #373

Optimize normalizeLongitude method #373

Comments

zongweil commented Jan 7, 2020 • edited Loading

kpym commented Jan 14, 2020

bocops commented Apr 21, 2021

fulldecent commented Apr 25, 2021 • edited Loading

fulldecent commented Apr 25, 2021

fulldecent commented Apr 25, 2021

fulldecent commented Apr 25, 2021

bocops commented Nov 15, 2021

kpym commented Nov 15, 2021

fulldecent commented Nov 15, 2021

kpym commented Nov 16, 2021

bocops commented Nov 16, 2021

kpym commented Nov 16, 2021 • edited Loading

fulldecent commented Nov 16, 2021

bocops commented Nov 16, 2021

fulldecent commented Nov 16, 2021

sonyaa commented Nov 18, 2021

zongweil commented Jan 7, 2020 •

edited

Loading

fulldecent commented Apr 25, 2021 •

edited

Loading

kpym commented Nov 16, 2021 •

edited

Loading