Assert quant_min/quant_max in embedding4bit #28590
Triggered via pull request
December 20, 2024 18:57
Status
Success
Total duration
1h 7m 2s
Artifacts
–
pull.yml
on: pull_request
Matrix: test-llama-runner-linux
gather-models
5s
unittest
/
...
/
linux-job
19m 3s
unittest
/
...
/
macos-job
11m 19s
unittest-arm
/
linux-job
18m 33s
test-binary-size-linux
/
job
test-binary-size-linux-gcc
/
job
test-custom-ops-linux
/
job
test-eval_llama-mmlu-linux
/
job
test-eval_llama-wikitext-linux
/
job
test-llama-runner-linux-android
/
job
test-llama_runner_eager-linux
/
job
test-llava-runner-linux
/
job
test-mediatek-models-linux
/
job
test-phi-3-mini-runner-linux
/
job
test-pybind-build-linux
/
job
test-quantized-aot-lib-linux
/
job
test-selective-build-linux
/
job
test-setup-linux-gcc
/
job
Matrix: test-llama-runner-qnn-linux
Matrix: test-models-linux
android
/
run-emulator
2m 39s