Transmutation

C+ 76 completed
Library
containerized / rust · small
153
Files
22,146
LOC
0
Frameworks
8
Languages

Pipeline State

completed
Run ID
#302856
Phase
done
Progress
1%
Started
Finished
2026-04-13 01:31:02
LLM tokens
0

Pipeline Metadata

Stage
Cataloged
Decision
proceed
Novelty
77.33
Framework unique
Isolation
Last stage change
2026-05-10 03:34:57
Deduplication group #50210
Member of a group with 3 similar repo(s) — canonical #76081 view group →
Top concepts (9)
RepositoryProject DescriptioninfrastructureData/MLtestingFactoryTestingFile ManagementDatabase
Repobility — same analyzer, your code, free for public repos · /scan/

AI Prompt

Create a high-performance document conversion engine, similar to Transmutation, written in pure Rust. This tool needs to transform various file formats into optimized text and image outputs suitable for LLM processing and vector embeddings. Specifically, it must support converting PDF, DOCX, XLSX, PPTX, HTML, XML, TXT, CSV/TSV, and RTF/ODT (even if in beta). For image inputs like JPG/JPEG and PNG, it should use OCR via Tesseract to output Markdown and JSON. The goal is to achieve speed comparable to or better than Docling, focusing on generating Markdown, JSON, and image outputs.
rust document-conversion llm embeddings pdf docx image-processing ocr high-performance
Generated by gemma4:latest

Catalog Information

Transmutation is a high-performance document conversion engine for AI/LLM embeddings, designed to transform various file formats into optimized text and image outputs suitable for LLM processing and vector embeddings.

Description

Transmutation is a pure Rust document conversion engine that transforms various file formats into optimized text and image outputs. It's designed for LLM processing and vector embeddings, offering superior speed, lower memory usage, and zero runtime dependencies. The project aims to provide a high-performance alternative to Docling, with a goal of achieving 95% similarity in precision mode.

الوصف

ترانسموطة هي محرك تحويل مستندات عالية السرعة لتعديلات AI/LLM، مصممة لتتحول تنسيقات المستندات المختلفة إلى Outputs النصية والصور الم оптимالية. وهي مصممة للاستخدام مع تعلم الآلة وتحليل المعاني، وتوفر أداء أفضل منافسها Docling، وذاكرته المنخفضة، ومكتبات لا تدوال في وقت التشغيل.

Novelty

9/10

Tags

document-conversion ai-llm-embeddings high-performance pure-rust zero-runtime-dependencies optimized-text-and-image-outputs

Technologies

serde tokio

Claude Models

claude-opus-4.5 claude-opus-4.6

Quality Score

C+
76.5/100
Structure
81
Code Quality
64
Documentation
86
Testing
65
Practices
78
Security
100
Dependencies
80

Strengths

  • CI/CD pipeline configured (github_actions)
  • Consistent naming conventions (snake_case)
  • Good security practices \u2014 no major issues detected
  • Containerized deployment (Docker)
  • Properly licensed project

Weaknesses

  • 1896 duplicate lines detected \u2014 consider DRY refactoring
  • 2 'god files' with >500 LOC need decomposition

Recommendations

  • Add a linter configuration to enforce code style consistency
  • Address 25 TODO/FIXME items \u2014 consider tracking them as issues

Security & Health

12.8h
Tech Debt (B)
High
DORA Rating
A
OWASP (100%)
Generated by Repobility's multi-pass static-analysis pipeline (https://repobility.com)
PASS
Quality Gate
A
Risk (1)
MIT
License
9.4%
Duplication
Full Security Report AI Fix Prompts SARIF SBOM

Languages

rust
64.0%
markdown
28.8%
shell
2.8%
yaml
1.7%
toml
0.9%
cpp
0.9%
text
0.6%
c
0.2%

Frameworks

None detected

Symbols

function620
extension112
struct94
enum16
constant15
type_alias12
trait3
macro2

Concepts (9)

Same analyzer free for public repos: https://repobility.com
CategoryNameDescriptionConfidence
Source: Repobility analyzer · https://repobility.com
design_patternRepositoryFound repository-named files80%
auto_descriptionProject DescriptionHigh-performance document conversion engine for AI/LLM embeddings80%
arch_layerinfrastructureDetected infrastructure layer70%
auto_categoryData/MLdata-ml70%
arch_layertestingDetected testing layer70%
design_patternFactoryFound factory/create_ naming patterns60%
business_logicTestingDetected from 10 related files50%
business_logicFile ManagementDetected from 3 related files50%
business_logicDatabaseDetected from 5 related files50%

Quality Timeline

1 quality score recorded.

View File Metrics
Repobility · code-quality intelligence · https://repobility.com

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/26680.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV

BinComp Dependency Hardening

All packages →
1 of this repo's dependencies have been scanned for binary hardening. Grade reflects RELRO / stack canary / FORTIFY / PIE coverage.
Nmarkdown3.10.2 · 0 gadgets · risk 787.5