sklearn

n8n nodes for scikit-learn machine learning algorithms

Package Information

Downloads: 0 weekly / 12 monthly

Latest Version: 0.6.0

Author: Arturo Vaine

Available Nodes

Sklearn Linear Regression

Perform linear regression using scikit-learn

Sklearn Logistic Regression

Perform logistic regression classification using scikit-learn

Sklearn Metrics

Calculate model evaluation metrics using scikit-learn

Sklearn Decision Tree

Decision Tree classifier and regressor using scikit-learn

Sklearn Random Forest

Random Forest classifier and regressor using scikit-learn

Sklearn Gradient Boosting

Gradient Boosting classifier and regressor using scikit-learn

Sklearn SVM

Support Vector Machine classifier and regressor using scikit-learn

Sklearn KNN

K-Nearest Neighbors classifier and regressor using scikit-learn

Sklearn Naive Bayes

Naive Bayes classifiers using scikit-learn

Sklearn KMeans

K-Means clustering using scikit-learn

Sklearn Standard Scaler

Standardize features using scikit-learn StandardScaler

Sklearn MinMax Scaler

Scale features to a given range using scikit-learn MinMaxScaler

Sklearn Label Encoder

Encode categorical labels as integers using scikit-learn

Sklearn One Hot Encoder

One-hot encode categorical features using scikit-learn

Sklearn PCA

Principal Component Analysis using scikit-learn

Sklearn Train Test Split

Split data into training and test sets using scikit-learn

Sklearn Datasets

Load sample datasets from scikit-learn

Sklearn Cross Validation

Perform cross-validation on sklearn models

Sklearn Grid Search CV

Perform hyperparameter tuning using Grid Search with Cross-Validation

Sklearn DBSCAN

DBSCAN clustering algorithm using scikit-learn

Sklearn Agglomerative Clustering

Agglomerative (hierarchical) clustering using scikit-learn

Sklearn Isolation Forest

Isolation Forest for anomaly detection using scikit-learn

Sklearn Simple Imputer

Handle missing values using scikit-learn SimpleImputer

Sklearn Polynomial Features

Generate polynomial and interaction features using scikit-learn

Sklearn Robust Scaler

Scale features using statistics robust to outliers (median and IQR)

Sklearn TF-IDF Vectorizer

Convert text to TF-IDF feature vectors using scikit-learn

Sklearn Feature Selection

Select best features using various sklearn methods

Sklearn Spectral Clustering

Graph-based spectral clustering using scikit-learn

Sklearn Mean Shift

Mean shift clustering to find dense regions using scikit-learn

Sklearn Truncated SVD

Dimensionality reduction using truncated SVD (LSA) for sparse data

Sklearn NMF

Non-negative Matrix Factorization for dimensionality reduction

Sklearn Normalizer

Normalize samples individually to unit norm using scikit-learn

Sklearn Binarizer

Binarize data (set feature values to 0 or 1) using scikit-learn

Sklearn Voting Classifier

Ensemble voting classifier combining multiple models

Sklearn Stacking Classifier

Stacked generalization ensemble classifier

Sklearn Calibrated Classifier

Probability calibration for classifiers using scikit-learn

Sklearn Elastic Net

Elastic Net regression (L1 + L2 regularization) using scikit-learn

Sklearn Ridge/Lasso

Ridge (L2) and Lasso (L1) regularized regression using scikit-learn

Sklearn MLP Neural Network

Multi-layer Perceptron neural network using scikit-learn

Sklearn Pipeline

Create and use sklearn pipelines chaining transformers and estimators

Documentation

n8n-nodes-sklearn

Custom n8n nodes for integrating scikit-learn machine learning algorithms into your n8n workflows.

Features

This package provides n8n nodes for popular scikit-learn functionality:

Sklearn Linear Regression: Train and predict using linear regression models
Sklearn Standard Scaler: Standardize features by removing the mean and scaling to unit variance

Installation

Prerequisites

Python 3.7+ with scikit-learn installed:
```
pip install scikit-learn numpy
```
n8n installed (version 0.190.0 or higher)

Installing the Package

Option 1: Install from npm (after publishing)

npm install n8n-nodes-sklearn

Option 2: Local Development Installation

Clone or download this repository
Navigate to the package directory:
```
cd n8n-nodes-sklearn
```
Install dependencies:
```
npm install
```
Build the package:
```
npm run build
```
Link the package to your n8n installation:
```
npm link
```
In your n8n custom nodes directory (usually ~/.n8n/custom):
```
npm link n8n-nodes-sklearn
```
Restart n8n to load the new nodes

Usage

Sklearn Linear Regression

Train Operation

Train a linear regression model using your input data.

Input Data Format:

[
  {
    "feature1": 1.0,
    "feature2": 2.0,
    "feature3": 3.0,
    "target": 10.5
  },
  {
    "feature1": 2.0,
    "feature2": 3.0,
    "feature3": 4.0,
    "target": 15.2
  }
]

Parameters:

Feature Columns: Comma-separated list of feature column names (e.g., feature1,feature2,feature3)
Target Column: Name of the target column (e.g., target)
Fit Intercept: Whether to calculate the intercept (default: true)
Python Path: Path to Python executable (default: python3)

Output:

{
  "model": "{...}",
  "coefficients": [1.2, 3.4, 2.1],
  "intercept": 0.5,
  "r2_score": 0.95,
  "feature_columns": ["feature1", "feature2", "feature3"],
  "training_samples": 100
}

Predict Operation

Make predictions using a trained model.

Parameters:

Model Data: JSON string containing the trained model (from train operation)
Feature Columns: Comma-separated list of feature columns (must match training)
Python Path: Path to Python executable

Output:
The original input data with an added prediction field.

Sklearn Standard Scaler

Fit Transform Operation

Fit the scaler to your data and transform it in one step.

Input Data Format:

[
  {
    "age": 25,
    "income": 50000,
    "score": 85
  },
  {
    "age": 35,
    "income": 75000,
    "score": 92
  }
]

Parameters:

Feature Columns: Comma-separated list of columns to scale (e.g., age,income,score)
With Mean: Whether to center the data (default: true)
With Std: Whether to scale to unit variance (default: true)
Output Prefix: Prefix for scaled columns (default: scaled_)
Python Path: Path to Python executable

Output:
Original data with scaled features added:

{
  "age": 25,
  "income": 50000,
  "score": 85,
  "scaled_age": -0.707,
  "scaled_income": -0.707,
  "scaled_score": -0.707,
  "scaler": "{...}",
  "scaler_info": {
    "mean": [30, 62500, 88.5],
    "scale": [7.071, 17677.67, 4.95]
  }
}

Fit Operation

Fit the scaler and save parameters for later use.

Output:

{
  "scaler": "{...}",
  "mean": [30, 62500, 88.5],
  "scale": [7.071, 17677.67, 4.95],
  "variance": [50, 312500000, 24.5],
  "feature_columns": ["age", "income", "score"],
  "fitted_samples": 100
}

Transform Operation

Transform data using a previously fitted scaler.

Parameters:

Scaler Data: JSON string from fit operation
Feature Columns: Comma-separated list of columns (must match fitted columns)
Output Prefix: Prefix for scaled columns

Example Workflows

Example 1: Simple Linear Regression Pipeline

Read CSV Node: Load your training data
Sklearn Linear Regression (Train): Train the model
Set Node: Store the model in a variable
Read CSV Node: Load new data for prediction
Sklearn Linear Regression (Predict): Make predictions

Example 2: Preprocessing Pipeline

HTTP Request Node: Fetch data from API
Sklearn Standard Scaler (Fit Transform): Normalize features
Sklearn Linear Regression (Train): Train on normalized data
Code Node: Evaluate model performance

Configuration

Python Path

By default, nodes use python3 command. If you need to specify a different Python executable:

Set the Python Path parameter in each node
Or set an environment variable before starting n8n:
```
export PYTHON_PATH=/path/to/python3
n8n start
```

Troubleshooting

Error: Python script failed

Ensure Python 3.7+ is installed
Verify scikit-learn is installed: python3 -c "import sklearn; print(sklearn.__version__)"
Check Python path in node parameters

Error: Feature column not found

Verify column names match your input data exactly (case-sensitive)
Check for extra spaces in column names

Memory issues with large datasets

Consider processing data in batches
Use n8n's batch processing features
Increase Node.js memory limit: NODE_OPTIONS=--max-old-space-size=4096 n8n start

Development

Building

npm run build

Linting

npm run lint
npm run lintfix  # Auto-fix issues

Adding New Nodes

Create a new directory in nodes/
Create YourNode.node.ts implementing INodeType
Add the node to package.json under n8n.nodes
Run npm run build

Roadmap

Future nodes planned:

Logistic Regression
Decision Trees / Random Forest
K-Means Clustering
Principal Component Analysis (PCA)
Support Vector Machines (SVM)
Model evaluation metrics
Cross-validation utilities

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

MIT

Support

For issues and questions:

GitHub Issues: https://github.com/arturovaine/n8n-nodes-sklearn/issues
n8n Community Forum: https://community.n8n.io/

Acknowledgments

Built on top of the excellent n8n workflow automation platform
Uses scikit-learn for machine learning functionality

sklearnInstall

Package Information

Available Nodes

Documentation

n8n-nodes-sklearn

Features

Installation

Prerequisites

Installing the Package

Option 1: Install from npm (after publishing)

Option 2: Local Development Installation

Usage

Sklearn Linear Regression

Train Operation

Predict Operation

Sklearn Standard Scaler

Fit Transform Operation

Fit Operation

Transform Operation

Example Workflows

Example 1: Simple Linear Regression Pipeline

Example 2: Preprocessing Pipeline

Configuration

Python Path

Troubleshooting

Development

Building

Linting

Adding New Nodes

Roadmap

Contributing

License

Support

Acknowledgments

Discussion

sklearn