Skip to content

Error while indexing large datasets like webis-touche using Fast Plaid #180

@meetdoshi90

Description

@meetdoshi90

Error message on torch quantile with FastPLAID
I was trying to evaluate web-touche using the evaluation code provided by pylate (1.3.4). I am using FastPLAID index with torch 2.8.0. Any wordarounds on this would be appreciated. Works fine for other smaller datasets.
Related issues: torch-64947

thread '' panicked at /root/.cache/puccinialin/cargo/registry/src/index.crates.io-1949cf8c6b5b557f/tch-0.20.0/src/wrappers/tensor_generated.rs:14691:57:
called Result::unwrap() on an Err value: Torch("quantile() input tensor is too large\nException raised from quantile_compute at /pytorch/aten/src/ATen/native/Sorting.cpp:290 (most recent call first):\nframe #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits, std::allocator >) + 0x80 (0x1486a5c18eb0 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libc10.so)\nframe #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) + 0x69 (0x1486a5bb5b5f in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libc10.so)\nframe #2: + 0x1c8da77 (0x1486e35bea77 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #3: + 0x1c8ed4b (0x1486e35bfd4b in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #4: at::native::quantile(at::Tensor const&, at::Tensor const&, std::optional, bool, std::basic_string_view<char, std::char_traits >) + 0x57 (0x1486e35c02b7 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #5: + 0x2be537d (0x1486e451637d in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #6: at::_ops::quantile::call(at::Tensor const&, at::Tensor const&, std::optional, bool, std::basic_string_view<char, std::char_traits >) + 0x228 (0x1486e3d3cc78 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #7: + 0x1e6924 (0x148376870924 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #8: + 0x161e1f (0x1483767ebe1f in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #9: + 0x191486 (0x14837681b486 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #10: + 0x191f52 (0x14837681bf52 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #11: + 0x190050 (0x14837681a050 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #12: + 0x191860 (0x14837681b860 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\n\nframe #28: + 0x295d0 (0x1487215b05d0 in /lib64/libc.so.6)\nframe #29: __libc_start_main + 0x80 (0x1487215b0680 in /lib64/libc.so.6)\n")
note: run with RUST_BACKTRACE=1 environment variable to display a backtrace
Tasks: 0%| | 0/5 [1:38:14<?, ?it/s]
Traceback (most recent call last):
File "/home/embedding/meet/TableRetrieval/Table-RAG/finetuning/granite-embeddings/scripts/landmark_index_colbert.py", line 519, in
index.add_documents(
File "/home/meet/new_conda/anaconda3/lib/python3.12/site-packages/pylate/indexes/plaid.py", line 217, in add_documents
return self._index.add_documents(
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/meet/new_conda/anaconda3/lib/python3.12/site-packages/pylate/indexes/fast_plaid.py", line 189, in add_documents
self.fast_plaid.create(
File "/home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/search/fast_plaid.py", line 283, in create
fast_plaid_rust.create(
pyo3_runtime.PanicException: called Result::unwrap() on an Err value: Torch("quantile() input tensor is too large\nException raised from quantile_compute at /pytorch/aten/src/ATen/native/Sorting.cpp:290 (most recent call first):\nframe #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits, std::allocator >) + 0x80 (0x1486a5c18eb0 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libc10.so)\nframe #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) + 0x69 (0x1486a5bb5b5f in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libc10.so)\nframe #2: + 0x1c8da77 (0x1486e35bea77 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #3: + 0x1c8ed4b (0x1486e35bfd4b in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #4: at::native::quantile(at::Tensor const&, at::Tensor const&, std::optional, bool, std::basic_string_view<char, std::char_traits >) + 0x57 (0x1486e35c02b7 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #5: + 0x2be537d (0x1486e451637d in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #6: at::_ops::quantile::call(at::Tensor const&, at::Tensor const&, std::optional, bool, std::basic_string_view<char, std::char_traits >) + 0x228 (0x1486e3d3cc78 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/torch/lib/libtorch_cpu.so)\nframe #7: + 0x1e6924 (0x148376870924 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #8: + 0x161e1f (0x1483767ebe1f in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #9: + 0x191486 (0x14837681b486 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #10: + 0x191f52 (0x14837681bf52 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #11: + 0x190050 (0x14837681a050 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\nframe #12: + 0x191860 (0x14837681b860 in /home/meet/new_conda/anaconda3/lib/python3.12/site-packages/fast_plaid/fast_plaid_rust.cpython-312-x86_64-linux-gnu.so)\n\nframe #28: + 0x295d0 (0x1487215b05d0 in /lib64/libc.so.6)\nframe #29: __libc_start_main + 0x80 (0x1487215b0680 in /lib64/libc.so.6)\n")

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions