bedrock-advanced

n8n community node for AWS Bedrock Chat Model with prompt caching support

Package Information

Downloads: 0 weekly / 0 monthly

Latest Version: 0.3.0

Author: Amir Souchami

Available Nodes

AWS Bedrock Chat Model (Advanced)

AWS Bedrock Language Model with prompt caching support

Documentation

n8n-nodes-bedrock-advanced

An advanced version of the official n8n AWS Bedrock Chat Model node, adding prompt caching and detailed token usage metrics including cache hit/miss reporting.

Why this node?

The official n8n Bedrock node does not support AWS Bedrock's prompt caching feature. When building AI agents with long system prompts or repeated tool definitions, prompt caching can significantly reduce both latency and cost by reusing previously processed tokens across requests.

This node is a drop-in replacement that adds:

Prompt caching with configurable TTL
Token usage metrics including cache read/write breakdowns visible in n8n's execution data

Features

Prompt Caching

Enable prompt caching to automatically inject cache points into system messages before they reach the Bedrock Converse API. Bedrock will cache the system prompt prefix and reuse it across subsequent requests within the TTL window.

Supported TTL options:

TTL	Supported Models
5 minutes	All cacheable models (Claude 3.5 Sonnet v2, 3.7 Sonnet, Amazon Nova, etc.)
1 hour	Claude 4.5 Opus, Sonnet 4.5, Haiku 4.5

Token Usage Metrics

Every execution reports detailed token usage in the node's output data:

{
  "tokenUsage": {
    "inputTokens": 150,
    "outputTokens": 500,
    "totalTokens": 650,
    "cacheReadInputTokens": 120,
    "cacheWriteInputTokens": 0
  }
}

Field	Description
`inputTokens`	Total input tokens processed
`outputTokens`	Tokens generated in the response
`totalTokens`	Combined input + output tokens
`cacheReadInputTokens`	Tokens served from cache (cache hit)
`cacheWriteInputTokens`	Tokens written to cache (cache miss / first call)

Metrics work for both streaming and non-streaming invocations.

All Original Features

Everything from the official node is preserved:

On-demand models and inference profiles
Dynamic model listing from your AWS account
Temperature and max tokens configuration
AWS credential support (access key, secret key, session token)

Installation

In n8n (community node)

Go to Settings > Community Nodes
Enter n8n-nodes-bedrock-advanced
Click Install

Manual installation

cd ~/.n8n
npm install n8n-nodes-bedrock-advanced

Then restart n8n.

Configuration

Add the AWS Bedrock Chat Model (Advanced) node to your workflow
Select your AWS credentials
Choose your model (on-demand or inference profile)
Under Options:
- Toggle Prompt Caching to enable
- Select Cache TTL (5 minutes or 1 hour)
- Adjust temperature and max tokens as needed

How Prompt Caching Works

When enabled, the node automatically appends a cachePoint marker after the system message content before sending the request to Bedrock. On the first call, Bedrock caches the system prompt tokens (cacheWriteInputTokens > 0). On subsequent calls within the TTL window, Bedrock reuses the cached tokens (cacheReadInputTokens > 0), reducing processing time and cost.

Requirements:

The cached content must remain identical across requests for cache hits
Minimum token thresholds apply per model (1,024 for Sonnet/Nova, 4,096 for Opus/Haiku)
Maximum 4 cache checkpoints per request (this node uses 1 for the system prompt)

Development

# Install dependencies
npm install

# Build
npm run build

# Watch mode
npm run dev

License

MIT

bedrock-advancedInstall