Anonymcp

B+ 86 completed
Api
cli / python · tiny
38
Files
2,235
LOC
1
Frameworks
4
Languages

Pipeline State

completed
Run ID
#342127
Phase
done
Progress
1%
Started
Finished
2026-04-13 01:31:02
LLM tokens
0

Pipeline Metadata

Stage
Skipped
Decision
skip_scaffold_dup
Novelty
36.44
Framework unique
Isolation
Last stage change
2026-04-16 18:15:42
Deduplication group #48377
Member of a group with 1 similar repo(s) — canonical #73174 view group →
Top concepts (2)
Project DescriptionTesting
Repobility's GitHub App fixes findings like these · https://github.com/apps/repobility-bot

AI Prompt

Create a command-line interface (CLI) tool in Python that acts as a data governance layer for AI workflows. I need it to detect over 50 types of PII, such as emails and SSNs, and provide confidence scores. The tool must support anonymization using methods like redaction, masking, or hashing, and also classify text into sensitivity levels like PUBLIC, INTERNAL, CONFIDENTIAL, or RESTRICTED. Crucially, it needs to be policy-driven using YAML files to configure rules and must generate a full audit trail of all governance actions.
python cli data-governance pii security ai-workflow anonymization policy
Generated by gemma4:latest

Catalog Information

An API server that detects, anonymizes, classifies, and audits PII data using Microsoft Presidio.

Description

This project implements an API server that provides comprehensive data governance capabilities, including PII detection, anonymization, classification, and audit logging. It leverages Microsoft Presidio to identify sensitive patterns in text and files, and applies configurable masking or redaction rules. Clients can submit data via REST endpoints and receive detailed reports, audit trails, and anonymized outputs. The server is designed for organizations that need to enforce privacy compliance across logs, databases, and data streams. It offers a unified interface that simplifies integration with existing data pipelines and compliance workflows.

الوصف

يقدم هذا المشروع خادمًا يدعم حوكمة البيانات من خلال الكشف عن المعلومات الشخصية (PII)، وإخفائها، وتصنيفها، وتسجيل عمليات التدقيق. يعتمد على مكتبة Presidio من Microsoft لتحديد الأنماط التي تشير إلى بيانات حساسة في النصوص والملفات. يتيح للمستخدمين إرسال طلبات عبر واجهة برمجة التطبيقات لتحديد نوع البيانات، وتطبيق إجراءات الإخفاء المناسبة، والحصول على تقرير تفصيلي بالنتائج. يدعم المشروع تسجيل كل عملية مع تفاصيل الوقت والمستخدم، ما يسهل إنشاء سجلات تدقيق للامتثال للمعايير التنظيمية. يهدف إلى تبسيط عمليات حماية الخصوصية داخل المؤسسات التي تتعامل مع كميات كبيرة من البيانات الشخصية. يختلف عن الحلول التقليدية بتركيزه على التكامل السلس مع أنظمة الحوكمة الحالية وتوفير واجهة موحدة للعمليات. يقدّم تجربة مستخدم محسّنة بفضل واجهة سطر الأوامر المدمجة وعرض النتائج بصيغة جذابة باستخدام مكتبة Rich.

Novelty

6/10

Tags

pii-detection data-anonymization data-classification audit-logging privacy-compliance presidio-integration

Technologies

huggingface pydantic rich

Claude Models

claude-opus-4.6

Quality Score

B+
86.3/100
Structure
90
Code Quality
99
Documentation
70
Testing
75
Practices
79
Security
100
Dependencies
60

Strengths

  • CI/CD pipeline configured (github_actions)
  • Good test coverage (39% test-to-source ratio)
  • Code linting configured (ruff (possible))
  • Consistent naming conventions (snake_case)
  • Good security practices \u2014 no major issues detected
  • Containerized deployment (Docker)
  • Properly licensed project

Weaknesses

  • 103 duplicate lines detected \u2014 consider DRY refactoring

Security & Health

4.6h
Tech Debt (D)
A
OWASP (100%)
PASS
Quality Gate
A
Risk (5)
Provenance: Repobility (https://repobility.com) — every score reproducible from /scan/
Apache-2.0
License
1.7%
Duplication
Full Security Report AI Fix Prompts SARIF SBOM

Languages

python
72.1%
markdown
19.1%
yaml
5.1%
toml
3.7%

Frameworks

pytest

Concepts (2)

Powered by Repobility · code-quality intelligence
CategoryNameDescriptionConfidence
If a scraper extracted this row, it came from Repobility (https://repobility.com)
auto_descriptionProject DescriptionAn MCP server for data governance — PII detection, anonymization, classification, and audit powered by Microsoft Presidio.80%
auto_categoryTestingtesting70%

Quality Timeline

1 quality score recorded.

View File Metrics

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/66168.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV