Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: huggingface/text-generation-inference
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v3.3.0
Choose a base ref
...
head repository: huggingface/text-generation-inference
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v3.3.1
Choose a head ref
  • 12 commits
  • 96 files changed
  • 6 contributors

Commits on May 10, 2025

  1. Change HPU warmup logic: seq length should be with exponential growth (

    …#3217)
    
    Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
    Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>
    kaixuanliu and regisss authored May 10, 2025
    Configuration menu
    Copy the full SHA
    c94f415 View commit details
    Browse the repository at this point in the history

Commits on May 12, 2025

  1. Adjust the round_up_seq logic in Gaudi backend (#3224)

    Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
    kaixuanliu authored May 12, 2025
    Configuration menu
    Copy the full SHA
    535ce23 View commit details
    Browse the repository at this point in the history

Commits on May 15, 2025

  1. Update to Torch 2.7.0 (#3221)

    * Update to Torch 2.7.0
    
    * Try to fix typer/click issue
    
    * Pin click to fix incompatibility with typer
    
    * Fix some test outputs with slight deviations
    
    * Attempt again to sync with CI
    
    * Mamba too
    
    * Fixup mllama
    
    Also switch to `unsloth/Llama-3.2-11B-Vision-Instruct` for testing
    from the EU :).
    danieldk authored May 15, 2025
    Configuration menu
    Copy the full SHA
    7e531f4 View commit details
    Browse the repository at this point in the history
  2. Enable Llama4 for Gaudi backend (#3223)

    Signed-off-by: yuanwu <yuan.wu@intel.com>
    yuanwu2017 authored May 15, 2025
    Configuration menu
    Copy the full SHA
    18cbecf View commit details
    Browse the repository at this point in the history

Commits on May 16, 2025

  1. Configuration menu
    Copy the full SHA
    58934c8 View commit details
    Browse the repository at this point in the history

Commits on May 19, 2025

  1. Deepseek R1 for Gaudi backend (#3211)

    Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
    sywangyi authored May 19, 2025
    Configuration menu
    Copy the full SHA
    d658b5d View commit details
    Browse the repository at this point in the history

Commits on May 20, 2025

  1. Refine warmup and upgrade to synapse AI 1.21.0 (#3234)

    Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
    sywangyi authored May 20, 2025
    Configuration menu
    Copy the full SHA
    000e313 View commit details
    Browse the repository at this point in the history
  2. Fix the crash in default ATTENTION path for Gaudi backend (#3235)

    Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
    sywangyi authored May 20, 2025
    Configuration menu
    Copy the full SHA
    43b1b07 View commit details
    Browse the repository at this point in the history

Commits on May 21, 2025

  1. Switch to punica-sgmv kernel from the Hub (#3236)

    * Switch to punica-sgmv kernel from the Hub
    
    This also switches (temporarily) to the tgi-nix/kernel-builder merge
    branch, bumping up to CUDA 12.8 (same as non-Nix Torch).
    
    * nix: client depends on aiohttp
    
    This probably worked before the nixpkgs bump because a dependency
    propagated aiohttp.
    danieldk authored May 21, 2025
    Configuration menu
    Copy the full SHA
    e325287 View commit details
    Browse the repository at this point in the history

Commits on May 22, 2025

  1. Move input_ids to hpu and remove disposal of adapter_meta (#3237)

    Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
    sywangyi authored May 22, 2025
    Configuration menu
    Copy the full SHA
    9e7e546 View commit details
    Browse the repository at this point in the history
  2. Prepare for 3.3.1 (#3238)

    danieldk authored May 22, 2025
    Configuration menu
    Copy the full SHA
    674c514 View commit details
    Browse the repository at this point in the history
  3. Release 3.3.1

    danieldk committed May 22, 2025
    Configuration menu
    Copy the full SHA
    767a652 View commit details
    Browse the repository at this point in the history
Loading