Isolation Forest

B 82 completed
Other
library / scala · small
69
Files
5,879
LOC
0
Frameworks
6
Languages

Pipeline State

completed
Run ID
#1479126
Phase
done
Progress
0%
Started
2026-04-16 11:16:56
Finished
2026-04-16 11:16:56
LLM tokens
0

Pipeline Metadata

Stage
Cataloged
Decision
proceed
Novelty
56.94
Framework unique
Isolation
Last stage change
2026-05-10 03:34:46
Deduplication group #56383
Member of a group with 2 similar repo(s) — canonical #1450395 view group →
Repobility · code-quality intelligence · https://repobility.com

AI Prompt

Create a library implementation for the Isolation Forest outlier detection algorithm. I need it to support both the standard Isolation Forest and an Extended Isolation Forest variant that uses random hyperplane splits. The core functionality should allow for distributed training and scoring using Scala and Spark data structures, ideally integrating with Spark's ML library for features like Pipelines. Additionally, please include support for saving and loading trained models, and the ability to export a trained model to ONNX format for cross-platform inference, with a Python example for using the ONNX model.
scala spark machine-learning outlier-detection isolation-forest onnx library distributed-computing
Generated by gemma4:latest

Catalog Information

Create a library implementation for the Isolation Forest outlier detection algorithm. I need it to support both the standard Isolation Forest and an Extended Isolation Forest variant that uses random hyperplane splits. The core functionality should allow for distributed training and scoring using Scala and Spark data structures, ideally integrating with Spark's ML library for features like Pipelines. Additionally, please include support for saving and loading trained models, and the ability to e

Tags

scala spark machine-learning outlier-detection isolation-forest onnx library distributed-computing

Quality Score

B
82.1/100
Structure
82
Code Quality
80
Documentation
69
Testing
85
Practices
84
Security
100
Dependencies
80

Strengths

  • CI/CD pipeline configured (github_actions)
  • Good test coverage (86% test-to-source ratio)
  • Code linting configured (ruff (possible))
  • Good security practices — no major issues detected
  • Properly licensed project

Weaknesses

  • 321 duplicate lines detected — consider DRY refactoring

Languages

scala
74.1%
python
14.5%
markdown
7.7%
yaml
3.2%
text
0.3%
toml
0.2%

Frameworks

None detected

Symbols

method18
class1
function1
variable1

Quality Timeline

1 quality score recorded.

View File Metrics

Embed Badge

Add to your README:

![Quality](https://repos.aljefra.com/badge/1177708.svg)
Quality BadgeSecurity Badge
Export Quality CSVDownload SBOMExport Findings CSV

BinComp Dependency Hardening

All packages →
2 of this repo's dependencies have been scanned for binary hardening. Grade reflects RELRO / stack canary / FORTIFY / PIE coverage.
Fnumpy2.4.4 · 6,596 gadgets · risk 0.0Nsetuptools82.0.1 · 0 gadgets · risk 0.0