fix: don't downcast large_string to string unnecessarily in concat_str for PyArrow#2176
Conversation
82c17c3 to
44ca7f6
Compare
|
this would break a plotly test I don't want to rush this then, leaving it out of tomorrow's release, we'll think about it for the next one |
|
The plotly test could be fixed by making will continue on the issue |
27dc13b to
a7a0b27
Compare
nw.String to pa.large_string (instead of pa.string) and nw.List to pa.large_list instead of pa.list_large_string to string unnecessarily in concat_str
large_string to string unnecessarily in concat_strlarge_string to string unnecessarily in concat_str for PyArrow
| return series._from_native_series(concat), offset_left + offset_right | ||
|
|
||
|
|
||
| def cast_to_comparable_string_types( |
There was a problem hiding this comment.
@MarcoGorelli I think I can shorten this and avoid the # type: ignore[arg-type].
Mind if I add a commit?
There was a problem hiding this comment.
@MarcoGorelli I think I can shorten this and avoid the
# type: ignore[arg-type].
I managed to get there in fewer characters - but LOC is bound by the name chunked_arrays 😄
(4dfc6a3)
I'll conclude my round of code golf there for today
Just reducing the indent levels a lil bit
| schema: The DataFrame schema as Schema or dict of {name: type}. If not | ||
| specified, the schema will be inferred by the native library. |
There was a problem hiding this comment.
This is more of a question than a suggestion.
Is there a preference between these two?
If not provided
and
If not specified
I probably wouldn't have noticed this if the lit(..., dtype=...) doc hadn't shown up in the diff
dtype: The data type of the literal value. If not provided, the data type will
be inferred by the native library.
There was a problem hiding this comment.
dunno, don't really mind
|
thanks Dan! |
closes #2097
What type of PR is this? (check all applicable)
Related issues
Checklist
If you have comments or can explain your changes, please do so below