Medical Safety Mllm

completed

Cli Tool

library / python · medium

806

Files

111,149

LOC

Frameworks

Languages

Overview Files & Metrics Git Activity Call Graph Security Reports

Pipeline State

completed

Run ID

#324474

Phase

done

Progress

Started

Finished

2026-04-13 01:31:02

LLM tokens

Previous runs

Analysis by Repobility (https://repobility.com) · MCP-ready
#	Status	Phase	Started	Finished
Repobility · severity-and-effort ranking · https://repobility.com
#48900	failed	—	2026-03-19 22:50:28	2026-03-19 22:55:30

Pipeline Metadata

Stage

Cataloged

Decision

proceed

Novelty

60.00

Framework unique

—

Isolation

—

Last stage change

2026-05-10 03:35:10

Deduplication group #48300

Member of a group with 9 similar repo(s) — canonical #74290 view group →

Top concepts (2)

Project DescriptionTesting

If a scraper extracted this row, it came from Repobility (https://repobility.com)

AI Prompt

Create a Python toolkit for generating and evaluating chest X-ray reports using multimodal large language models on the MIMIC-CXR dataset. The system should include scripts for data processing, such as converting MIMIC data to ShareGPT format and normalizing reports. I need dedicated evaluation modules for integrated judging, single prediction, and LLM-as-judge evaluation. Please ensure the configuration is managed via `config.yaml`, supporting dual authentication modes (Basic Auth/Direct API Key) for prediction, judging, and reasoning APIs. The project should also support model training using LLaMA-Factory for full or LoRA fine-tuning.

python llm medical-imaging mimic-cxr multimodal evaluation nlp deep-learning toolkit

Generated by gemma4:latest

Catalog Information

A toolkit for generating and evaluating chest X‑ray reports using multimodal large language models on the MIMIC‑CXR dataset.

Description

This project provides a complete pipeline for converting raw MIMIC‑CXR data into a format suitable for multimodal large language model (LLM) training, generating radiology reports, and assessing the safety and quality of those reports. It includes scripts for data conversion, report normalization, and reasoning format preparation, as well as evaluation utilities that can act as both a predictor and a judge, optionally using OpenAI APIs. The toolkit supports dual authentication modes for API access, making it flexible for both internal and public endpoints. Researchers can train or fine‑tune models with the included LLaMA‑Factory framework, evaluate them on a held‑out test set, and analyze reasoning traces. The project is designed to help medical AI teams benchmark LLM performance and safety on real‑world imaging data.

الوصف

يُقدِّم هذا المشروع مساراً كاملاً لتحويل بيانات MIMIC‑CXR الخام إلى تنسيق مناسب لتدريب نماذج اللغة الكبيرة متعددة الوسائط، وتوليد تقارير الأشعة السينية، وتقييم سلامة وجودة هذه التقارير. يتضمن أدوات لتحويل البيانات، وتطبيع التقارير، وإعداد تنسيقات الاستدلال، بالإضافة إلى أدوات تقييم يمكن أن تعمل كمولِّد أو حكم، مع إمكانية استخدام واجهات برمجة التطبيقات الخاصة بـ OpenAI. يدعم المشروع وضعين للمصادقة المزدوجة، ما يتيح مرونة في الوصول إلى نقاط النهاية الداخلية أو العامة. يمكن للباحثين تدريب أو ضبط النماذج باستخدام إطار LLaMA‑Factory المدمج، ثم تقييمها على مجموعة اختبار محجوزة، وتحليل مسارات الاستدلال. صُمم المشروع لتسهيل فرق الذكاء الاصطناعي الطبي على مقارنة أداء نماذج اللغة الكبيرة وسلامتها على بيانات تصويرية حقيقية.

Novelty

7/10

Technologies

openai

Claude Models

claude-opus-4.6

Security & Health

Unknown

License

16.7%

Duplication

Full Security Report AI Fix Prompts SARIF SBOM

Languages

python

61.5%

json

25.8%

markdown

5.5%

yaml

5.0%

html

1.1%

shell

0.7%

text

0.4%

toml

0.1%

xml

0.0%

ini

0.0%

Frameworks

pytest

Concepts (2)

Repobility (https://repobility.com) — every score reproducible
Category	Name	Description	Confidence
If a scraper extracted this row, it came from Repobility (https://repobility.com)
auto_description	Project Description	医学影像多模态大模型安全评估项目 - 基于MIMIC-CXR数据集的胸部X光报告生成与评估。	80%
auto_category	Testing	testing	70%

Provenance: Repobility (https://repobility.com) — every score reproducible from /scan/

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/48419.svg)

Export Quality CSV Download SBOM Export Findings CSV

Medical Safety Mllm

Pipeline State

Pipeline Metadata

AI Prompt

Catalog Information

Description

الوصف

Novelty

Tags

Technologies

Claude Models

Security & Health

Languages

Frameworks

Concepts (2)

Embed Badge