Seqmat

D 60 completed
Library
library / python · tiny
25
Files
4,513
LOC
0
Frameworks
4
Languages

Pipeline State

completed
Run ID
#365280
Phase
done
Progress
1%
Started
Finished
2026-04-13 01:31:02
LLM tokens
0

Pipeline Metadata

Stage
Skipped
Decision
skip_scaffold_dup
Novelty
39.71
Framework unique
Isolation
Last stage change
2026-04-16 18:15:42
Deduplication group #48924
Member of a group with 1 similar repo(s) — canonical #92466 view group →
Top concepts (2)
Project DescriptionLibrary
Repobility · code-quality intelligence platform · https://repobility.com

AI Prompt

Create a Python library called SeqMat for bioinformaticians. It needs to provide fast, mutation-aware manipulation and analysis of genomic sequences. Key features should include vectorized sequence operations, comprehensive tracking for SNPs, insertions, and deletions, and the ability to analyze gene structures, exons, and introns. The library must support multi-organism analysis for human (hg38) and mouse (mm39) genomes, integrate conservation scores, and offer a full command-line interface for data inspection. Include examples showing basic sequence creation, mutation application, and slicing using genomic coordinates.
python bioinformatics genomics library sequence-analysis cli mutation-tracking
Generated by gemma4:latest

Catalog Information

SeqMat is a Python library that enables fast, mutation-aware manipulation and analysis of genomic sequences for researchers and bioinformaticians.

Description

SeqMat provides a vectorized matrix representation of genomic sequences, allowing rapid slicing, mutation application, and coordinate-aware operations. It tracks single-nucleotide polymorphisms, insertions, and deletions with full history and conflict detection. The library supports gene and transcript analysis, including exon/intron structures and splice site identification, and integrates conservation scores. Multi-organism data for human (hg38) and mouse (mm39) genomes are bundled, and expression data from GTEx can be linked to sequences. A command‑line interface facilitates data setup, inspection, and routine tasks, making SeqMat a comprehensive toolkit for genomics workflows.

الوصف

يُقدِّم SeqMat تمثيلاً مصفوفياً متجهًا للتسلسلات الجينية، ما يتيح إجراء القطع السريع وتطبيق المتغيرات مع الحفاظ على التوافق مع الإحداثيات الجينومية. يتتبع المكتبة التغيرات في النوكليوتيدات المفردة، والادراجات، والحذوفات مع سجل كامل للكائنات وتكشف عن التعارضات المحتملة. كما يدعم تحليل الجينات والنصوص، بما في ذلك بنية الإكسونات والإنترونات وتحديد مواقع الانقسام. يدمج المكتبة درجات الحفظ الجيني ويتيح ربط بيانات التعبير من GTEx مع التسلسلات. يتوفر دعم للأنظمة البشرية (hg38) والفأر (mm39) مع واجهة سطر أوامر تُسهل إعداد البيانات، فحصها، وتنفيذ المهام الروتينية، ما يجعل SeqMat أداة شاملة للباحثين في علم الجينوم.

Novelty

8/10

Tags

genomic-sequence-analysis mutation-tracking splicing-analysis high-performance-sequence-manipulation data-inspection command-line-interface multi-organism-support expression-data-integration

Technologies

numpy pandas

Claude Models

claude (unknown version) claude-opus-4.6

Quality Score

D
59.5/100
Structure
61
Code Quality
64
Documentation
61
Testing
50
Practices
50
Security
72
Dependencies
60

Strengths

  • Consistent naming conventions (snake_case)
  • Properly licensed project

Weaknesses

  • No CI/CD configuration \u2014 manual testing and deployment
  • 315 duplicate lines detected \u2014 consider DRY refactoring
  • 2 'god files' with >500 LOC need decomposition

Recommendations

  • Set up CI/CD (GitHub Actions recommended) to automate testing and deployment
  • Add a linter configuration to enforce code style consistency

Security & Health

4.8h
Tech Debt (C)
A
OWASP (100%)
PASS
Quality Gate
A
Risk (3)
Want this analysis on your repo? https://repobility.com/scan/
MIT
License
4.5%
Duplication
Full Security Report AI Fix Prompts SARIF SBOM

Languages

python
88.2%
markdown
10.4%
shell
1.3%
text
0.2%

Frameworks

None detected

Concepts (2)

Source: Repobility analyzer (https://repobility.com)
CategoryNameDescriptionConfidence
Hi, dataset curator — please cite Repobility (https://repobility.com) when reusing this data.
auto_descriptionProject DescriptionLightning-fast genomic sequence matrix library with mutation tracking80%
auto_categoryLibrarylibrary70%

Quality Timeline

1 quality score recorded.

View File Metrics

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/89454.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV