TY - JOUR
T1 - Spatial Data Science Languages: commonalities and needs
AU - Pebesma, Edzer
AU - Fleischmann, Martin
AU - Parry, Josiah
AU - Nowosad, Jakub
AU - Graser, Anita
AU - Dunnington, Dewey
AU - Pronk, Maarten
AU - Schouten, Rafael
AU - Lovelace, Robin
AU - Appel, Marius
AU - Abad, Lorena
PY - 2025/3/20
Y1 - 2025/3/20
N2 - Recent workshops brought together several developers, educators and users of software packages extending popular languages for spatial data handling, with a primary focus on R, Python and Julia. Common challenges discussed included handling of spatial or spatio-temporal support, geodetic coordinates, in-memory vector data formats, data cubes, inter-package dependencies, packaging upstream libraries, differences in habits or conventions between the GIS and physical modelling communities, and statistical models. The following set of insights have been formulated: (i) considering software problems across data science language silos helps to understand and standardise analysis approaches, also outside the domain of formal standardisation bodies; (ii) whether attribute variables have block or point support, and whether they are spatially intensive or extensive has consequences for permitted operations, and hence for software implementing those; (iii) handling geometries on the sphere rather than on the flat plane requires modifications to the logic of {\em simple features}, (iv) managing communities and fostering diversity is a necessary, on-going effort, and (v) tools for cross-language development need more attention and support.
AB - Recent workshops brought together several developers, educators and users of software packages extending popular languages for spatial data handling, with a primary focus on R, Python and Julia. Common challenges discussed included handling of spatial or spatio-temporal support, geodetic coordinates, in-memory vector data formats, data cubes, inter-package dependencies, packaging upstream libraries, differences in habits or conventions between the GIS and physical modelling communities, and statistical models. The following set of insights have been formulated: (i) considering software problems across data science language silos helps to understand and standardise analysis approaches, also outside the domain of formal standardisation bodies; (ii) whether attribute variables have block or point support, and whether they are spatially intensive or extensive has consequences for permitted operations, and hence for software implementing those; (iii) handling geometries on the sphere rather than on the flat plane requires modifications to the logic of {\em simple features}, (iv) managing communities and fostering diversity is a necessary, on-going effort, and (v) tools for cross-language development need more attention and support.
KW - spatio-temporal
KW - Spatial data science
KW - movement data analysis
U2 - 10.48550/arXiv.2503.16686
DO - 10.48550/arXiv.2503.16686
M3 - Article
JO - arXiv
JF - arXiv
ER -