Skip to content

Feature: compress_output_ciphertexts in Configuration #1310

@ImamH0ssain

Description

@ImamH0ssain

Use case: Hybrid FHE LLM inference. Output ciphertexts from Server.run() dominate communication cost (~1.6 GB per inference for a 2-layer TinyLlama setup). Input ciphertext compression via compress_input_ciphertexts=True works great, but there is no equivalent for outputs.

Request: Add compress_output_ciphertexts: bool = False to concrete.fhe.Configuration, using the GLWE packing / CompressedCiphertextList mechanism already available in the TFHE-rs backend.

Context: TFHE-rs already supports post-computation compression via CompressedCiphertextList / CompressedCiphertextListBuilder. Exposing this through Concrete's Python API would dramatically reduce server→client bandwidth for FHE-as-a-service workloads.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions