-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Comparing changes
Open a pull request
base repository: huggingface/text-generation-inference
base: v3.3.2
head repository: huggingface/text-generation-inference
compare: v3.3.3
- 14 commits
- 97 files changed
- 5 contributors
Commits on Jun 3, 2025
-
Remove useless packages (#3253)
Signed-off-by: yuanwu <yuan.wu@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 1ff9d18 - Browse repository at this point
Copy the full SHA 1ff9d18View commit details
Commits on Jun 10, 2025
-
Bump neuron SDK version (#3260)
* chore(neuron): bump version to 0.2.0 * refactor(neuron): use named parameters in inputs helpers This allows to hide the differences between the two backends in terms of input parameters. * refactor(neuron): remove obsolete code paths * fix(neuron): use neuron_config whenever possible * fix(neuron): use new cache import path * fix(neuron): neuron config is not stored in config anymore * fix(nxd): adapt model retrieval to new APIs * fix(generator): emulate greedy in sampling parameters When on-device sampling is enabled, we need to emulate the greedy behaviour using top-k=1, top-p=1, temperature=1. * test(neuron): update models and expectations * feat(neuron): support on-device sampling * fix(neuron): adapt entrypoint * tests(neuron): remove obsolete models * fix(neuron): adjust test expectations for llama on nxd
Configuration menu - View commit details
-
Copy full SHA for 79183d1 - Browse repository at this point
Copy the full SHA 79183d1View commit details
Commits on Jun 11, 2025
-
[gaudi] Perf optimization (#3256)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 8394776 - Browse repository at this point
Copy the full SHA 8394776View commit details
Commits on Jun 12, 2025
-
[gaudi] Vlm rebase and issue fix in benchmark test (#3263)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 613b8dd - Browse repository at this point
Copy the full SHA 613b8ddView commit details -
[gaudi] Move the _update_cos_sin_cache into get_cos_sin (#3254)
Signed-off-by: yuanwu <yuan.wu@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 25fdc5f - Browse repository at this point
Copy the full SHA 25fdc5fView commit details -
[Gaudi] Remove optimum-habana (#3261)
Signed-off-by: yuanwu <yuan.wu@intel.com>
Configuration menu - View commit details
-
Copy full SHA for e07056a - Browse repository at this point
Copy the full SHA e07056aView commit details
Commits on Jun 13, 2025
-
[gaudi] HuggingFaceM4/idefics2-8b issue fix (#3264)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Configuration menu - View commit details
-
Copy full SHA for a220e57 - Browse repository at this point
Copy the full SHA a220e57View commit details -
[Gaudi] Enable Qwen3_moe model (#3244)
Signed-off-by: yuanwu <yuan.wu@intel.com>
Configuration menu - View commit details
-
Copy full SHA for ded4cb5 - Browse repository at this point
Copy the full SHA ded4cb5View commit details -
[Gaudi] Fix the integration-test issues (#3265)
Signed-off-by: yuanwu <yuan.wu@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 3752143 - Browse repository at this point
Copy the full SHA 3752143View commit details
Commits on Jun 17, 2025
-
[Gaudi] use pad_token_id to pad input id (#3268)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 0627983 - Browse repository at this point
Copy the full SHA 0627983View commit details
Commits on Jun 18, 2025
-
Configuration menu - View commit details
-
Copy full SHA for b4d17f1 - Browse repository at this point
Copy the full SHA b4d17f1View commit details -
[gaudi] Refine logging for Gaudi warmup (#3222)
* Refine logging for Gaudi warmup * Make style * Make style 2 * Flash causal LM case * Add log_master & VLM cases * Black
Configuration menu - View commit details
-
Copy full SHA for f13e28c - Browse repository at this point
Copy the full SHA f13e28cView commit details -
Configuration menu - View commit details
-
Copy full SHA for bd1bdeb - Browse repository at this point
Copy the full SHA bd1bdebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1754b79 - Browse repository at this point
Copy the full SHA 1754b79View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v3.3.2...v3.3.3