Skip to main content

v0.9.0 - 2026-01-02

New features
  • GGUF is now the default inference engine for model bundling, generating .gguf files for llama.cpp inference.
  • Add --executorch flag to use ExecuTorch bundling instead of GGUF. ExecuTorch inference is deprecated and may be removed in a future version.
  • Add --mmproj-quantization option for GGUF bundling of vision-language and audio models.
  • Support downloading multiple .gguf files for GGUF bundle requests.

v0.8.0 - 2025-12-16

Improvements
  • Update GGUF model download interface to take model name and quantization rather than manifest URL directly.

v0.7.0 - 2025-11-19

New features
  • Download GGUF model files from JSON manifest
  • leap-bundle download <json-manifest-url> [--output-path <output-path>] [--overwrite]
  • Bundle request download will throw error if the file already exists in the output path.
  • Use --overwrite option to allow overwriting the existing file.
Bug fixes
  • Show correct filenames when uploading files for bundle creation

v0.6.0 - 2025-11-05

New features
  • Support latest LFM2-VL model.
Improvements
  • Move model type validation to server side for faster new model support in the future.

v0.5.1 - 2025-10-06

Bug fixes
  • Add validation to reject Liquid audio model submissions.
  • Liquid audio models are not supported by the bundle service yet.
  • They will be in a future release.

v0.5.0 - 2025-09-24

New features
  • Add --quantization option for the create command to specify the quantization type.
  • Supported quantization types: 8da4w_output_8da8w, 8da8w_output_8da8w.
  • Default quantization: 8da4w_output_8da8w.

v0.4.0 - 2025-09-16

New features
  • Add resume command to resume an interrupted or failed upload.
Improvements
  • Add parallel upload support for faster upload speed.
  • Users can choose between --parallel and --sequential upload modes for create and resume commands.
  • By default, --parallel upload is used. If it fails, upload will fall back to --sequential mode.
  • Add --json option to create, resume, and list commands to output the final results in JSON for easy parsing.
  • Add --last option to list command to show only the last request.
  • Show user-friendly information when upload fails.
Fixes
  • Fix --output-path option for download command.
  • Previously this option was ignored and the bundle file would always be downloaded to the current directory.

v0.3.0 - 2025-08-26

New features
  • Support lfm2-vl models.
Improvements
  • Show user-friendly information and error messages.
  • Fix minor bugs.

v0.2.0 - 2025-08-12

New features
  • Add validate command.
Improvements
  • Add Notes columns in request table.
  • Validate input model path before creating a request.
  • Update and add processing error message.
  • Show download URL and link to doc page.

v0.1.1 - 2025-07-30

Improvements
  • Test and report network connectivity before model upload.

v0.1.0 - 2025-07-25

Initial release of the leap-bundle package.