## New Features - [ ] https://github.com/TransferQueue/TransferQueue/issues/85 - [x] Implement `check_data_consumption_status` & `check_data_production_status` for `TransferQueueController` https://github.com/TransferQueue/TransferQueue/pull/157 ### Backend Related - [x] Support Ray RDT backend https://github.com/TransferQueue/TransferQueue/pull/167 - [x] Support MoonCakeStore backend https://github.com/TransferQueue/TransferQueue/pull/162 ## Adaption Works - [ ] Support verl multi-modal scenario https://github.com/TransferQueue/verl/tree/agentloop_tq - [ ] Merge with verl main code entry - [ ] Support verl router replay https://github.com/TransferQueue/verl/tree/new_router_replay ## Performance - [x] Optimize data spliting & gathering opterations (`get_transfer_data`, `StorageUnitData.put_data/get_data`, etc.) @jianjunzhong https://github.com/TransferQueue/TransferQueue/pull/151 - [ ] Optimize the socket event loop by splitting into to multiple threads, one for each type of request - [ ] Strengthen `TransferQueueController` concurrency - [ ] Improve zero-copy performance https://github.com/TransferQueue/TransferQueue/pull/163 https://github.com/TransferQueue/TransferQueue/pull/121 ## User Interface - [ ] Unified environmental variable management ## Debug - [x] Performance metric https://github.com/TransferQueue/TransferQueue/pull/132 - [ ] Command line debug interface - [x] Cleanup redundant logs https://github.com/TransferQueue/TransferQueue/pull/160 https://github.com/TransferQueue/TransferQueue/pull/159 https://github.com/TransferQueue/TransferQueue/pull/132 - [x] Better metadata print https://github.com/TransferQueue/TransferQueue/pull/133 ## Minor Issues & Code Clean - [ ] Provide a general dynamic socket function for both Client & Storage - [ ] Make dtypes & shapes compulsory for `update_production_status` - [ ] Optimize `_merge_tensors_to_tensordict` to serve as a general function since similar codes may scatter everywhere inside TQ - [x] Split the zero copy optimization from zmq_utils.py to serial_utils.py providing a general serialization util for tensordict (both device-side and cpu-side) https://github.com/TransferQueue/TransferQueue/pull/140 ## Documentary - [ ] Build doc web site - [x] Provide tutorials https://github.com/TransferQueue/TransferQueue/pull/141
New Features
check_data_consumption_status&check_data_production_statusforTransferQueueController[Feat]: add check_data_production_status and check_consumption_status and support Polling get metadata TransferQueue/TransferQueue#157Backend Related
Adaption Works
Performance
get_transfer_data,StorageUnitData.put_data/get_data, etc.) @jianjunzhong [Refactor] Simplify the put/get logic of AsyncSimpleStorageManager TransferQueue/TransferQueue#151TransferQueueControllerconcurrencyUser Interface
Debug
Minor Issues & Code Clean
update_production_status_merge_tensors_to_tensordictto serve as a general function since similar codes may scatter everywhere inside TQDocumentary