Csv Nose
C+ 77 completed
Library
unknown / rust · tiny
43
Files
6,893
LOC
0
Frameworks
6
Languages
Pipeline State
completedRun ID
#305666Phase
doneProgress
1%Started
Finished
2026-04-13 01:31:02LLM tokens
0Pipeline Metadata
Stage
SkippedDecision
skip_scaffold_dupNovelty
51.84Framework unique
—Isolation
—Last stage change
2026-04-16 18:15:42Deduplication group #47500
Member of a group with 1 similar repo(s) — canonical #92845 view group →
Top concepts (5)
Project Descriptiondata_accesstestingCLI ToolTesting
All rows above produced by Repobility · https://repobility.com
🧪 Code Distillation
Browse all specs →Sample distilled functions (click for full spec)
test_is_uniformTests the is_uniform function by creating and asserting conditions on two instances of a Table structure. It first initializes a table with equal field counts and asserts that the table is uniform. Next, it initializes a second table with unequal field counts and asserts that the table is not unifor
test_tau_1_uniformVerifies the calculation of a specific statistical measure, $\tau_1$, for a data structure representing field counts. It takes an initialized and configured Table object as its primary input. The function asserts that the resulting $\tau_1$ value is approximately equal to 1.0, which is the expected
test_tau_0_variedInitializes a table structure with predefined field counts for five fields and then updates the internal modal field count based on these values. Calculates a specific metric, tau_0, using the modified table structure as input. Asserts that the resulting tau_0 value is strictly between zero and one,
AI Prompt
Create a Rust library and CLI tool, similar to csv-nose, for detecting CSV dialects. The tool should implement the Table Uniformity Method to analyze CSV files. I need functionality to sniff local files, handle multiple files, and even sniff remote CSV URLs if the HTTP feature is enabled. The output should allow specifying formats like JSON, or just showing the delimiter. Please ensure the library usage examples show how to get metadata like the delimiter, header status, fields, and inferred types.
rust cli library csv data-processing parsing command-line-tool
Generated by gemma4:latest
Catalog Information
csv-nose is a Rust library and CLI tool for detecting CSV dialects, providing metadata about the file's structure and content.
Description
This project implements the Table Uniformity Method for CSV dialect detection, achieving 99.55% accuracy on the W3C-CSVW test suite. It can be used as a library or CLI tool to sniff CSV files, detecting their delimiter, quote character, line terminator, and field types.
الوصف
هذا المشروع يimplement طريقة Uniformity Method لتحديد تنسيق CSV، وتحقيق دقة 99.55% على مجموعة اختبار W3C-CSVW. يمكن استخدامها ك مكتبة أو أداة CLI للكشف عن ملفات CSV، تحديد علامة التجزئة، حرف العلامة المزدوجة، علامة نهاية السطر، و أنواع الحقول.
Novelty
7/10Tags
csv-dialect-detection table-uniformity-method data-type-inference file-sniffing metadata-extraction
Claude Models
claude-sonnet-4.6
Quality Score
C+
77.4/100
Structure
77
Code Quality
64
Documentation
85
Testing
75
Practices
80
Security
100
Dependencies
80
Strengths
- CI/CD pipeline configured (github_actions)
- Good test coverage (47% test-to-source ratio)
- Consistent naming conventions (snake_case)
- Good security practices \u2014 no major issues detected
Weaknesses
- No LICENSE file \u2014 legal ambiguity for contributors
- 648 duplicate lines detected \u2014 consider DRY refactoring
- 2 'god files' with >500 LOC need decomposition
Recommendations
- Add a linter configuration to enforce code style consistency
- Add a LICENSE file (MIT recommended for open source)
Security & Health
4.6h
Tech Debt (B)
Medium
DORA Rating
A
OWASP (100%)
Repobility · code-quality intelligence platform · https://repobility.com
PASS
Quality Gate
A
Risk (2)
MIT
License
10.6%
Duplication
Languages
Frameworks
None detected
Symbols
function202
constant31
extension26
struct18
enum9
type_alias1
Concepts (5)
| Category | Name | Description | Confidence | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Repobility (the analyzer behind this table) · https://repobility.com | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| auto_description | Project Description | A Rust port of the Table Uniformity Method for CSV dialect detection. | 80% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| arch_layer | data_access | Detected data_access layer | 70% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| arch_layer | testing | Detected testing layer | 70% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| auto_category | CLI Tool | cli | 70% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| business_logic | Testing | Detected from 8 related files | 50% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Source: Repobility analyzer · https://repobility.com
Embed Badge
Add to your README:
