Why single precision by default in Haversine? #216

juliohm · 2021-04-23T13:08:23Z

Line 14 in 369f586

Haversine() = Haversine(Float32(6_371_000))

Most lat/lon datasets are double precision so I don't understand why the single precision here. Can we double it?

nalimilan · 2021-04-26T20:43:28Z

See #176 (comment). The idea was that if you have Float64 inputs, the result with be promoted to Float64, but if you have lower precision inputs they will keep using Float32. What's your use case?

juliohm · 2021-04-28T19:30:46Z

lat/lon values are stored as Float64 in most GIS datasets. Computing anything in single precision is pretty rare because lat/lon values differ in the very last digits of the double precision representation. That is why I got confused with Float32.

This internal design decision could be made explicit in the docstring so that end users coming from GIS won't question it again. I did a quick benchmark to see if the promotion from single to double precision changed the performance, but apparently it doesn't.

juliohm · 2021-04-28T19:33:06Z

If we talk about the most common use case of the Haversine, it will be in GIS applications. And in this case I would just set the default to Float64 to avoid confusion. People interested in Haversine for other exotic uses like GPU for example, could explicitly provide other type parameter.

nalimilan · 2021-05-02T20:52:16Z

Do you have a concrete example where this could be confusing?

@mkborregaard @dkarrasch What do you think?

juliohm · 2021-05-02T21:24:03Z

Just instantiating the distance will show the type parameter in the REPL and this is confusing for someone expecting double precision by default.

dkarrasch · 2021-05-03T14:06:44Z

I believe that having a "minimal" type here is (at the risk of human confusion) the better choice. There is no lack of precision for that number, and for higher-precision floats it will be promoted anyway. To me personally, it feels a bit like the I in LinearAlgebra, whose default type is Bool for the same reason. It would be a different story if performance was suffering, though. Then we should perhaps take care of the most common use case.

juliohm · 2021-05-12T11:53:22Z

Thank you, I think it makes sense given that there is no performance penalty.

Perhaps a note in the docstring would be enough.

nalimilan · 2021-09-04T14:59:52Z

See #226 (comment): it turns out we can just use an Int value as the default.

mkborregaard · 2021-09-06T08:03:13Z

great!

juliohm closed this as completed May 12, 2021

nalimilan mentioned this issue Sep 4, 2021

Small improvements to Haversine #226

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why single precision by default in Haversine? #216

Why single precision by default in Haversine? #216

juliohm commented Apr 23, 2021 •

edited

Loading

nalimilan commented Apr 26, 2021

juliohm commented Apr 28, 2021

juliohm commented Apr 28, 2021

nalimilan commented May 2, 2021

juliohm commented May 2, 2021

dkarrasch commented May 3, 2021

juliohm commented May 12, 2021

nalimilan commented Sep 4, 2021

mkborregaard commented Sep 6, 2021

Why single precision by default in Haversine? #216

Why single precision by default in Haversine? #216

Comments

juliohm commented Apr 23, 2021 • edited Loading

nalimilan commented Apr 26, 2021

juliohm commented Apr 28, 2021

juliohm commented Apr 28, 2021

nalimilan commented May 2, 2021

juliohm commented May 2, 2021

dkarrasch commented May 3, 2021

juliohm commented May 12, 2021

nalimilan commented Sep 4, 2021

mkborregaard commented Sep 6, 2021

juliohm commented Apr 23, 2021 •

edited

Loading