Refactor: use Apache Arrow compute for string function#885
Refactor: use Apache Arrow compute for string function#885maartenbreddels merged 3 commits intomasterfrom
Conversation
adf48db to
da0a232
Compare
|
FYI: If you have a bit of patience (like 24h of patience), you could use the arrow conda packages in the arrow-nightlies channel instead of building it yourself. |
da0a232 to
229f9ce
Compare
|
Great, I didn't know it existed, and was difficult to find, thanks a lot! |
820f61c to
d55a96a
Compare
|
@JovanVeljanoski would be great if you can add/finish the str->booleans added in apache/arrow#7656 |
|
@JovanVeljanoski I think I want to merge this early and leave it for you to do the rest in a different PR, we need some of this in #865 and I also want to merge that soon. |
dbc8a7b to
0f89f54
Compare
| _doc_snippets['chunk_size_export'] = 'Number of rows to be written to disk in a single iteration' | ||
| _doc_snippets['evaluate_parallel'] = 'Evaluate the (virtual) columns in parallel' | ||
| _doc_snippets['array_type'] = 'Type of output array, possible values are None/"numpy" (ndarray), "xarray" for a xarray.DataArray, or "list" for a Python list' | ||
| _doc_snippets['ascii'] = 'Transform only ascii character (usually faster).' |
There was a problem hiding this comment.
character -> characters
0f89f54 to
62e8302
Compare
025988f to
2a85f7c
Compare
e24fc95 to
5ed5b5b
Compare
5e50d73 to
394e70a
Compare
|
windows CI has becomes crazy slow btw, we'll have to trace back when/why that happened. It seem the conda env creation takes ages. |
This is a draft PR to check the status of arrow compute with vaex. I think we likely cherry pick from this branch as arrow makes new releases.