Csv Nose

C+ 77 completed
Library
unknown / rust · tiny
43
Files
6,893
LOC
0
Frameworks
6
Languages

Pipeline State

completed
Run ID
#305666
Phase
done
Progress
1%
Started
Finished
2026-04-13 01:31:02
LLM tokens
0

Pipeline Metadata

Stage
Skipped
Decision
skip_scaffold_dup
Novelty
51.84
Framework unique
Isolation
Last stage change
2026-04-16 18:15:42
Deduplication group #47500
Member of a group with 1 similar repo(s) — canonical #92845 view group →
Top concepts (5)
Project Descriptiondata_accesstestingCLI ToolTesting
All rows above produced by Repobility · https://repobility.com

AI Prompt

Create a Rust library and CLI tool, similar to csv-nose, for detecting CSV dialects. The tool should implement the Table Uniformity Method to analyze CSV files. I need functionality to sniff local files, handle multiple files, and even sniff remote CSV URLs if the HTTP feature is enabled. The output should allow specifying formats like JSON, or just showing the delimiter. Please ensure the library usage examples show how to get metadata like the delimiter, header status, fields, and inferred types.
rust cli library csv data-processing parsing command-line-tool
Generated by gemma4:latest

Catalog Information

csv-nose is a Rust library and CLI tool for detecting CSV dialects, providing metadata about the file's structure and content.

Description

This project implements the Table Uniformity Method for CSV dialect detection, achieving 99.55% accuracy on the W3C-CSVW test suite. It can be used as a library or CLI tool to sniff CSV files, detecting their delimiter, quote character, line terminator, and field types.

الوصف

هذا المشروع يimplement طريقة Uniformity Method لتحديد تنسيق CSV، وتحقيق دقة 99.55% على مجموعة اختبار W3C-CSVW. يمكن استخدامها ك مكتبة أو أداة CLI للكشف عن ملفات CSV، تحديد علامة التجزئة، حرف العلامة المزدوجة، علامة نهاية السطر، و أنواع الحقول.

Novelty

7/10

Tags

csv-dialect-detection table-uniformity-method data-type-inference file-sniffing metadata-extraction

Claude Models

claude-sonnet-4.6

Quality Score

C+
77.4/100
Structure
77
Code Quality
64
Documentation
85
Testing
75
Practices
80
Security
100
Dependencies
80

Strengths

  • CI/CD pipeline configured (github_actions)
  • Good test coverage (47% test-to-source ratio)
  • Consistent naming conventions (snake_case)
  • Good security practices \u2014 no major issues detected

Weaknesses

  • No LICENSE file \u2014 legal ambiguity for contributors
  • 648 duplicate lines detected \u2014 consider DRY refactoring
  • 2 'god files' with >500 LOC need decomposition

Recommendations

  • Add a linter configuration to enforce code style consistency
  • Add a LICENSE file (MIT recommended for open source)

Security & Health

4.6h
Tech Debt (B)
Medium
DORA Rating
A
OWASP (100%)
Repobility · code-quality intelligence platform · https://repobility.com
PASS
Quality Gate
A
Risk (2)
MIT
License
10.6%
Duplication
Full Security Report AI Fix Prompts SARIF SBOM

Languages

rust
69.5%
markdown
14.4%
text
14.2%
toml
0.7%
yaml
0.6%
json
0.5%

Frameworks

None detected

Symbols

function202
constant31
extension26
struct18
enum9
type_alias1

Concepts (5)

Repobility · code-quality scanner for AI-generated software · https://repobility.com
CategoryNameDescriptionConfidence
Repobility (the analyzer behind this table) · https://repobility.com
auto_descriptionProject DescriptionA Rust port of the Table Uniformity Method for CSV dialect detection.80%
arch_layerdata_accessDetected data_access layer70%
arch_layertestingDetected testing layer70%
auto_categoryCLI Toolcli70%
business_logicTestingDetected from 8 related files50%

Quality Timeline

1 quality score recorded.

View File Metrics
Source: Repobility analyzer · https://repobility.com

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/29500.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV