docs: Clarify arrow usage (#16152)

pola-rs · May 10, 2024 · d341156 · d341156
1 parent b38aa4b
commit d341156
Show file tree

Hide file tree

Showing 3 changed files with 4 additions and 4 deletions.
diff --git a/docs/user-guide/ecosystem.md b/docs/user-guide/ecosystem.md
@@ -2,7 +2,7 @@
 
 ## Introduction
 
-On this page you can find a non-exhaustive list of libraries and tools that support Polars. As the data ecosystem is evolving fast, more libraries will likely support Polars in the future. One of the main drivers is that Polars makes use of `Apache Arrow` in it's backend.
+On this page you can find a non-exhaustive list of libraries and tools that support Polars. As the data ecosystem is evolving fast, more libraries will likely support Polars in the future. One of the main drivers is that Polars makes adheres its memory layout to the `Apache Arrow` spec.
 
 ### Table of contents:
 

diff --git a/docs/user-guide/expressions/missing-data.md b/docs/user-guide/expressions/missing-data.md
@@ -4,7 +4,7 @@ This page sets out how missing data is represented in Polars and how missing dat
 
 ## `null` and `NaN` values
 
-Each column in a `DataFrame` (or equivalently a `Series`) is an Arrow array or a collection of Arrow arrays [based on the Apache Arrow format](https://arrow.apache.org/docs/format/Columnar.html#null-count). Missing data is represented in Arrow and Polars with a `null` value. This `null` missing value applies for all data types including numerical values.
+Each column in a `DataFrame` (or equivalently a `Series`) is an Arrow array or a collection of Arrow arrays [based on the Apache Arrow spec](https://arrow.apache.org/docs/format/Columnar.html#null-count). Missing data is represented in Arrow and Polars with a `null` value. This `null` missing value applies for all data types including numerical values.
 
 Polars also allows `NotaNumber` or `NaN` values for float columns. These `NaN` values are considered to be a type of floating point data rather than missing data. We discuss `NaN` values separately below.
 

diff --git a/docs/user-guide/migration/pandas.md b/docs/user-guide/migration/pandas.md
@@ -23,9 +23,9 @@ more explicit, more readable and less error-prone.
 
 Note that an 'index' data structure as known in databases will be used by Polars as an optimization technique.
 
-### Polars uses Apache Arrow arrays to represent data in memory while pandas uses NumPy arrays
+### Polars adheres to the Apache Arrow memory format to represent data in memory while pandas uses NumPy arrays
 
-Polars represents data in memory with Arrow arrays while pandas represents data in
+Polars represents data in memory according to the Arrow memory spec while pandas represents data in
 memory with NumPy arrays. Apache Arrow is an emerging standard for in-memory columnar
 analytics that can accelerate data load times, reduce memory usage and accelerate
 calculations.