Ktransformers

completed

Ai Ml

monorepo / python · medium

1,143

Files

221,774

LOC

Frameworks

Languages

Overview Files & Metrics Git Activity Call Graph Security Reports

Pipeline State

completed

Run ID

#307646

Phase

done

Progress

Started

Finished

2026-04-13 01:31:02

LLM tokens

Previous runs

Repobility · code-quality scanner for AI-generated software · https://repobility.com
#	Status	Phase	Started	Finished
Repobility · severity-and-effort ranking · https://repobility.com
#31542	failed	SYMBOL_EXTRACTION	2026-03-07 03:00:05	—

Pipeline Metadata

Stage

Cataloged

Decision

proceed

Novelty

88.00

Framework unique

—

Isolation

—

Last stage change

2026-05-10 03:35:02

Deduplication group #60534

Member of a group with 1 similar repo(s) — this repo is canonical view group →

Top concepts (2)

Project DescriptionWeb Backend

About: code-quality intelligence by Repobility · https://repobility.com

🧪 Code Distillation

AI Prompt

Create a research framework, KTransformers, for efficiently handling large language model inference and fine-tuning using CPU-GPU heterogeneous computing. The system should feature two core modules: `kt-kernel` for high-performance inference and `kt-sft` for fine-tuning. I need support for various models like MiniMax, GLM, and Kimi, and the ability to handle advanced features such as CPU-GPU expert scheduling and native BF16/FP8 precision. Please structure the project using Python, FastAPI, and Vue.js components where appropriate for documentation or UI elements.

python fastapi vue.js research llm inference fine-tuning cpu-gpu heterogeneous-computing

Generated by gemma4:latest

Catalog Information

KTransformers is a research project focused on efficient inference and fine-tuning of large language models through CPU-GPU heterogeneous computing.

Description

KTransformers is a flexible framework for experiencing cutting-edge LLM inference/fine-tune optimizations. It has two core modules: kt-kernel, which provides high-performance inference kernels, and kt-sft, a fine-tuning framework. The project supports various large language models, including MiniMax-M2.5, GLM-5, Kimi-K2.5, and others. It also offers features like CPU-GPU expert scheduling, native BF16 and FP8 precision, and autoDL unified fine-tuning and inference.

الوصف

هو إطار عمل مرن للاستخدام المبتكر للتحسينات في الاستدلال والتعديل المتقدم للعلماء الكبيرة من خلال الحوسبة المختلطة CPU-GPU. يحتوي على وحدتين رئيسيتين: kt-kernel، الذي يوفر نواة استدلال عالية الأداء، وkt-sft، إطار عمل التعديل. يدعم المشروع العديد من العوالم اللغوية الكبيرة، بما في ذلك MiniMax-M2.5، GLM-5، Kimi-K2.5، وغيرها. كما يقدم ميزات مثل التخطيط المتقدم CPU-GPU، الدقة BF16 والFP8 المحلية، وتحسين التعديل الموحد.

Novelty

9/10

Claude Models

claude-opus-4.6

Security & Health

Medium

DORA Rating

Apache-2.0

License

42.6%

Duplication

Full Security Report AI Fix Prompts SARIF SBOM

Languages

python

62.9%

cpp

21.1%

yaml

4.7%

markdown

3.8%

2.1%

vue

1.5%

shell

0.8%

typescript

0.7%

text

0.7%

css

0.6%

html

0.5%

json

0.3%

Repobility · code-quality intelligence · https://repobility.com

Frameworks

FastAPI Vue.js Jest

Symbols

method3,652

variable2,795

function1,624

macro1,569

class1,016

constant417

type_alias165

struct155

property38

interface26

module11

enum10

API Endpoints (64)

Data scored by Repobility · https://repobility.com
Method	Path	Handler	Framework
Repobility's GitHub App fixes findings like these · https://github.com/apps/repobility-bot
GET	`/`	list_threads	FastAPI
POST	`/`	create_assistant	FastAPI
GET	`/`	list_assistants	FastAPI
GET	`/`	list_threads	FastAPI
POST	`/`	create_thread	FastAPI
GET	`/`	list_assistants	FastAPI
POST	`/`	create_assistant	FastAPI
POST	`/`	create_thread	FastAPI
POST	`/{assistant_id}`	modify_assistant	FastAPI
DELETE	`/{assistant_id}`	delete_assistant	FastAPI
DELETE	`/{assistant_id}`	delete_assistant	FastAPI
POST	`/{assistant_id}`	modify_assistant	FastAPI
GET	`/{assistant_id}`	retrieve_assistant	FastAPI
GET	`/{assistant_id}`	retrieve_assistant	FastAPI
GET	`/{assistant_id}/related_thread`	get_related_thread	FastAPI
GET	`/{assistant_id}/related_thread`	get_related_thread	FastAPI
POST	`/chat`	chat	FastAPI
POST	`/chat`	chat	FastAPI
POST	`/chat/completions`	chat_completion	FastAPI
POST	`/chat/completions`	chat_completion	FastAPI
POST	`/completions`	create_completion	FastAPI
POST	`/completions`	create_completion	FastAPI
POST	`/generate`	generate	FastAPI
POST	`/generate`	generate	FastAPI
GET	`/models`	list_models	FastAPI
GET	`/models`	list_models	FastAPI
POST	`/runs`	create_thread_and_run	FastAPI
POST	`/runs`	create_thread_and_run	FastAPI
POST	`/show`	show	FastAPI
POST	`/show`	show	FastAPI
GET	`/status`	list_assistants_with_status	FastAPI
GET	`/status`	list_assistants_with_status	FastAPI
GET	`/system-info`	system_info	FastAPI
GET	`/system-info`	system_info	FastAPI
GET	`/tags`	tags	FastAPI
GET	`/tags`	tags	FastAPI
DELETE	`/{thread_id}`	delete_thread	FastAPI
POST	`/{thread_id}`	modify_thread	FastAPI
DELETE	`/{thread_id}`	delete_thread	FastAPI
GET	`/{thread_id}`	retrieve_thread	FastAPI
GET	`/{thread_id}`	retrieve_thread	FastAPI
POST	`/{thread_id}`	modify_thread	FastAPI
GET	`/{thread_id}/messages`	list_messages	FastAPI
POST	`/{thread_id}/messages`	create_message	FastAPI
GET	`/{thread_id}/messages`	list_messages	FastAPI
POST	`/{thread_id}/messages`	create_message	FastAPI
GET	`/{thread_id}/messages/{message_id}`	retrieve_message	FastAPI
GET	`/{thread_id}/messages/{message_id}`	retrieve_message	FastAPI
POST	`/{thread_id}/messages/{message_id}`	modify_message	FastAPI
DELETE	`/{thread_id}/messages/{message_id}`	delete_message	FastAPI

Showing 50 of 64

Concepts (2)

Same analyzer free for public repos: https://repobility.com
Category	Name	Description	Confidence
If a scraper extracted this row, it came from Repobility (https://repobility.com)
auto_description	Project Description	KTransformers is a research project focused on efficient inference and fine-tuning of large language models through CPU-GPU heterogeneous computing. The project has evolved into two core modules: kt-kernel and kt-sft.	80%
auto_category	Web Backend	web-backend	70%

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/31491.svg)

Export Quality CSV Download SBOM Export Findings CSV

BinComp Dependency Hardening

All packages →

22 of this repo's dependencies have been scanned for binary hardening. Grade reflects RELRO / stack canary / FORTIFY / PIE coverage.

Ktransformers

Pipeline State

Pipeline Metadata

🧪 Code Distillation

AI Prompt

Catalog Information

Description

الوصف

Novelty

Tags

Claude Models

Security & Health

Languages

Frameworks

Symbols

API Endpoints (64)

Concepts (2)

Embed Badge

BinComp Dependency Hardening