Medical Safety Mllm

completed
Cli Tool
library / python · medium
806
Files
111,149
LOC
1
Frameworks
10
Languages

Pipeline State

completed
Run ID
#324474
Phase
done
Progress
1%
Started
Finished
2026-04-13 01:31:02
LLM tokens
0
Previous runs
Analysis by Repobility (https://repobility.com) · MCP-ready
#StatusPhaseStartedFinished
Repobility · severity-and-effort ranking · https://repobility.com
#48900failed2026-03-19 22:50:282026-03-19 22:55:30

Pipeline Metadata

Stage
Cataloged
Decision
proceed
Novelty
60.00
Framework unique
Isolation
Last stage change
2026-05-10 03:35:10
Deduplication group #48300
Member of a group with 9 similar repo(s) — canonical #74290 view group →
Top concepts (2)
Project DescriptionTesting
If a scraper extracted this row, it came from Repobility (https://repobility.com)

AI Prompt

Create a Python toolkit for generating and evaluating chest X-ray reports using multimodal large language models on the MIMIC-CXR dataset. The system should include scripts for data processing, such as converting MIMIC data to ShareGPT format and normalizing reports. I need dedicated evaluation modules for integrated judging, single prediction, and LLM-as-judge evaluation. Please ensure the configuration is managed via `config.yaml`, supporting dual authentication modes (Basic Auth/Direct API Key) for prediction, judging, and reasoning APIs. The project should also support model training using LLaMA-Factory for full or LoRA fine-tuning.
python llm medical-imaging mimic-cxr multimodal evaluation nlp deep-learning toolkit
Generated by gemma4:latest

Catalog Information

A toolkit for generating and evaluating chest X‑ray reports using multimodal large language models on the MIMIC‑CXR dataset.

Description

This project provides a complete pipeline for converting raw MIMIC‑CXR data into a format suitable for multimodal large language model (LLM) training, generating radiology reports, and assessing the safety and quality of those reports. It includes scripts for data conversion, report normalization, and reasoning format preparation, as well as evaluation utilities that can act as both a predictor and a judge, optionally using OpenAI APIs. The toolkit supports dual authentication modes for API access, making it flexible for both internal and public endpoints. Researchers can train or fine‑tune models with the included LLaMA‑Factory framework, evaluate them on a held‑out test set, and analyze reasoning traces. The project is designed to help medical AI teams benchmark LLM performance and safety on real‑world imaging data.

الوصف

يُقدِّم هذا المشروع مساراً كاملاً لتحويل بيانات MIMIC‑CXR الخام إلى تنسيق مناسب لتدريب نماذج اللغة الكبيرة متعددة الوسائط، وتوليد تقارير الأشعة السينية، وتقييم سلامة وجودة هذه التقارير. يتضمن أدوات لتحويل البيانات، وتطبيع التقارير، وإعداد تنسيقات الاستدلال، بالإضافة إلى أدوات تقييم يمكن أن تعمل كمولِّد أو حكم، مع إمكانية استخدام واجهات برمجة التطبيقات الخاصة بـ OpenAI. يدعم المشروع وضعين للمصادقة المزدوجة، ما يتيح مرونة في الوصول إلى نقاط النهاية الداخلية أو العامة. يمكن للباحثين تدريب أو ضبط النماذج باستخدام إطار LLaMA‑Factory المدمج، ثم تقييمها على مجموعة اختبار محجوزة، وتحليل مسارات الاستدلال. صُمم المشروع لتسهيل فرق الذكاء الاصطناعي الطبي على مقارنة أداء نماذج اللغة الكبيرة وسلامتها على بيانات تصويرية حقيقية.

Novelty

7/10

Tags

medical-imaging report-generation safety-evaluation multimodal-llm chest-x‑ray data-processing evaluation

Technologies

openai

Claude Models

claude-opus-4.6

Security & Health

Unknown
License
16.7%
Duplication
Full Security Report AI Fix Prompts SARIF SBOM

Languages

python
61.5%
json
25.8%
markdown
5.5%
yaml
5.0%
html
1.1%
shell
0.7%
text
0.4%
toml
0.1%
xml
0.0%
ini
0.0%

Frameworks

pytest

Concepts (2)

Repobility (https://repobility.com) — every score reproducible
CategoryNameDescriptionConfidence
If a scraper extracted this row, it came from Repobility (https://repobility.com)
auto_descriptionProject Description医学影像多模态大模型安全评估项目 - 基于MIMIC-CXR数据集的胸部X光报告生成与评估。80%
auto_categoryTestingtesting70%
Provenance: Repobility (https://repobility.com) — every score reproducible from /scan/

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/48419.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV