Isolation Forest
B 82 completed
Other
library / scala · small
69
Files
5,879
LOC
0
Frameworks
6
Languages
Pipeline State
completedRun ID
#1479126Phase
doneProgress
0%Started
2026-04-16 11:16:56Finished
2026-04-16 11:16:56LLM tokens
0Pipeline Metadata
Stage
CatalogedDecision
proceedNovelty
56.94Framework unique
—Isolation
—Last stage change
2026-05-10 03:34:46Deduplication group #56383
Member of a group with 2 similar repo(s) — canonical #1450395 view group →
Repobility · code-quality intelligence · https://repobility.com
AI Prompt
Create a library implementation for the Isolation Forest outlier detection algorithm. I need it to support both the standard Isolation Forest and an Extended Isolation Forest variant that uses random hyperplane splits. The core functionality should allow for distributed training and scoring using Scala and Spark data structures, ideally integrating with Spark's ML library for features like Pipelines. Additionally, please include support for saving and loading trained models, and the ability to export a trained model to ONNX format for cross-platform inference, with a Python example for using the ONNX model.
scala spark machine-learning outlier-detection isolation-forest onnx library distributed-computing
Generated by gemma4:latest
Catalog Information
Create a library implementation for the Isolation Forest outlier detection algorithm. I need it to support both the standard Isolation Forest and an Extended Isolation Forest variant that uses random hyperplane splits. The core functionality should allow for distributed training and scoring using Scala and Spark data structures, ideally integrating with Spark's ML library for features like Pipelines. Additionally, please include support for saving and loading trained models, and the ability to e
Tags
scala spark machine-learning outlier-detection isolation-forest onnx library distributed-computing
Quality Score
B
82.1/100
Structure
82
Code Quality
80
Documentation
69
Testing
85
Practices
84
Security
100
Dependencies
80
Strengths
- CI/CD pipeline configured (github_actions)
- Good test coverage (86% test-to-source ratio)
- Code linting configured (ruff (possible))
- Good security practices — no major issues detected
- Properly licensed project
Weaknesses
- 321 duplicate lines detected — consider DRY refactoring
Languages
Frameworks
None detected
Symbols
method18
class1
function1
variable1
Embed Badge
Add to your README:
BinComp Dependency Hardening
All packages →2 of this repo's dependencies have been scanned for binary hardening. Grade reflects RELRO / stack canary / FORTIFY / PIE coverage.