Transfermarkt Datasets

C+ 76 completed
Data Tool
infrastructure / python · small
113
Files
5,611
LOC
1
Frameworks
7
Languages

Pipeline State

completed
Run ID
#297505
Phase
done
Progress
1%
Started
Finished
2026-04-13 01:31:02
LLM tokens
0

Pipeline Metadata

Stage
Cataloged
Decision
proceed
Novelty
70.24
Framework unique
Isolation
Last stage change
2026-05-10 03:34:36
Deduplication group #62121
Member of a group with 1 similar repo(s) — this repo is canonical view group →
Top concepts (9)
testingpresentationbusiness_logicdata_accessinfrastructureDatabaseFile ManagementTestingConfiguration
All rows scored by the Repobility analyzer (https://repobility.com)

AI Prompt

Create a comprehensive Python data pipeline to extract, prepare, and publish Transfermarkt football datasets. I need the system to handle multiple data sources, structured into 10 distinct tables covering players, clubs, games, and transfers. The pipeline should be robust enough to be automatically updated and ideally use tools like dbt for transformation. Please structure the project to manage configuration via YAML and support data versioning.
python data-pipeline transfermarkt dbt sql yaml data-extraction sports-analytics
Generated by gemma4:latest

Catalog Information

This project extracts, prepares, publishes, and updates Transfermarkt datasets for use in various applications.

Description

Transfermarkt-datasets is a project that automates the process of extracting, preparing, publishing, and updating Transfermarkt datasets. It utilizes Python scripts to fetch data from Transfermarkt and store it in a structured format. The project leverages popular libraries such as pandas, plotly, and scipy for data manipulation and visualization. Streamlit is used to create an interactive interface for users to explore the datasets. This project aims to provide a convenient way to access and utilize Transfermarkt data.

الوصف

هذا المشروع يقوم بتحليل وتحويل وتنظيم بيانات ترنسفرماركت لاستخدامها في تطبيقات متعددة. يستخدم هذا المشروع سكريبتات بايثون لجلب البيانات من موقع ترنسفرماركت وتخزينها في صيغة منظمة. يعتمد المشروع على مكتبات شائعة مثل بانداس، بلاطلي، وسيباي، وستريمليت لتحليل البيانات والتصوير. هذا المشروع يسعى إلى توفير وسيلة سهلة للوصول إلى بيانات ترنسفرماركت.

Novelty

5/10

Tags

data-extraction data-preparation data-publishing transfermarkt-data sports-data

Technologies

aws-sdk pandas plotly scipy streamlit

Claude Models

claude-opus-4.6 claude-sonnet-4.6

Quality Score

C+
75.8/100
Structure
79
Code Quality
100
Documentation
67
Testing
65
Practices
58
Security
65
Dependencies
90

Strengths

  • CI/CD pipeline configured (github_actions)
  • Code linting configured (ruff (possible))
  • Consistent naming conventions (snake_case)
  • Low average code complexity \u2014 well-structured code
  • Containerized deployment (Docker)
  • Properly licensed project

Security & Health

4.6h
Tech Debt (B)
Medium
DORA Rating
A
OWASP (100%)
Repobility · code-quality intelligence · https://repobility.com
PASS
Quality Gate
A
Risk (2)
CC0-1.0
License
0.9%
Duplication
Full Security Report AI Fix Prompts SARIF SBOM

Languages

python
38.7%
yaml
18.9%
sql
18.0%
json
13.3%
markdown
8.1%
toml
1.9%
shell
1.0%

Frameworks

pytest

Symbols

variable93
method40
function27
class20
constant12
property8

Concepts (9)

Findings produced by Repobility · scan your repo at https://repobility.com/scan/
CategoryNameDescriptionConfidence
Methodology: Repobility · https://repobility.com/research/state-of-ai-code-2026/
arch_layertestingDetected testing layer70%
arch_layerpresentationDetected presentation layer70%
arch_layerbusiness_logicDetected business_logic layer70%
arch_layerdata_accessDetected data_access layer70%
arch_layerinfrastructureDetected infrastructure layer70%
business_logicDatabaseDetected from 26 related files50%
business_logicFile ManagementDetected from 15 related files50%
business_logicTestingDetected from 8 related files50%
business_logicConfigurationDetected from 4 related files50%

Quality Timeline

1 quality score recorded.

View File Metrics
Repobility analyzer · published findings · https://repobility.com

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/21295.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV