Skip to content

Actions: ggml-org/llama.cpp

Actions

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
11,833 workflow runs
11,833 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

json : support enum values within allOf
Server #18611: Pull request #15830 synchronize by aldehir
September 6, 2025 22:43 18m 32s aldehir:fix-allOf-enum
September 6, 2025 22:43 18m 32s
CUDA: faster tile FA (Pascal/AMD), headsize 256 (#15769)
Server #18610: Commit 79bc429 pushed by JohannesGaessler
September 6, 2025 22:26 19m 6s master
September 6, 2025 22:26 19m 6s
kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16…
Server #18606: Commit c4df49a pushed by taronaeo
September 6, 2025 14:08 13m 54s master
September 6, 2025 14:08 13m 54s
llguidance : use attrs to determine special tokens
Server #18605: Pull request #15837 synchronize by dstoc
September 6, 2025 13:45 26m 29s dstoc:llguidance-special-tokens
September 6, 2025 13:45 26m 29s
llguidance : use attrs to determine special tokens
Server #18604: Pull request #15837 opened by dstoc
September 6, 2025 13:35 Action required dstoc:llguidance-special-tokens
September 6, 2025 13:35 Action required
server : speed up tests (#15836)
Server #18603: Commit 3c3635d pushed by ngxson
September 6, 2025 12:45 29m 9s master
September 6, 2025 12:45 29m 9s
webgpu : fix build on emscripten
Server #18602: Pull request #15826 synchronize by ngxson
September 6, 2025 12:26 36m 28s ngxson:xsn/emscripten_webgpu
September 6, 2025 12:26 36m 28s
imatrix: calculate activation-based statistics for new format (GGUF) imatrices
Server #18600: Pull request #14891 synchronize by EAddario
September 6, 2025 12:09 13m 34s EAddario:imatrix
September 6, 2025 12:09 13m 34s
server : speed up tests
Server #18599: Pull request #15836 synchronize by ngxson
September 6, 2025 11:51 20m 53s xsn/server_speedup_test
September 6, 2025 11:51 20m 53s
server : speed up tests
Server #18598: Pull request #15836 synchronize by ngxson
September 6, 2025 11:38 13m 7s xsn/server_speedup_test
September 6, 2025 11:38 13m 7s
server : implement prompt processing progress report in stream mode (…
Server #18597: Commit 61bdfd5 pushed by ngxson
September 6, 2025 11:35 33m 32s master
September 6, 2025 11:35 33m 32s
server : speed up tests
Server #18596: Pull request #15836 synchronize by ngxson
September 6, 2025 11:34 4m 53s xsn/server_speedup_test
September 6, 2025 11:34 4m 53s
server : speed up tests
Server #18595: Pull request #15836 opened by ngxson
September 6, 2025 11:30 4m 24s xsn/server_speedup_test
September 6, 2025 11:30 4m 24s
server : implement prompt processing progress report in stream mode
Server #18592: Pull request #15827 synchronize by ngxson
September 6, 2025 10:43 13m 37s xsn/server_progress_api
September 6, 2025 10:43 13m 37s
server : implement prompt processing progress report in stream mode
Server #18591: Pull request #15827 synchronize by ngxson
September 6, 2025 10:36 7m 12s xsn/server_progress_api
September 6, 2025 10:36 7m 12s
server : implement prompt processing progress report in stream mode
Server #18590: Pull request #15827 synchronize by ngxson
September 6, 2025 10:23 13m 30s xsn/server_progress_api
September 6, 2025 10:23 13m 30s
vulkan: add mul_mat variant for embedded gpus
Server #18589: Pull request #15800 synchronize by rmatif
September 6, 2025 09:59 12m 37s rmatif:vk-mulmat-embed
September 6, 2025 09:59 12m 37s
metal : make the backend async
Server #18588: Pull request #15832 opened by ggerganov
September 6, 2025 09:28 14m 34s gg/metal-async
September 6, 2025 09:28 14m 34s
json : support enum values within allOf
Server #18587: Pull request #15830 opened by aldehir
September 6, 2025 07:27 14m 0s aldehir:fix-allOf-enum
September 6, 2025 07:27 14m 0s