Getting Started¶

Installation¶

Prerequisites

Ubuntu 22.04+ (primary development/testing platform)
Vivado Design Suite 2024.2 (migration to 2025.1 in process)
[Optional] Cmake for BERT example V80 shell integration

git clone https://github.com/microsoft/brainsmith.git ./brainsmith
cd brainsmith

(Option A): Local Development with Poetry¶

Prerequisites

Python 3.11+ and Poetry
[Optional] direnv for automatic environment activation

Run automated setup script

./setup-venv.sh

Edit the project configuration file to customize project settings

vim brainsmith.yaml

Activate environment manually or with direnv

source .venv/bin/activate && source .brainsmith/env.sh
# If using direnv, just reload directory
cd .

Query project settings to confirm your configuration is loaded correctly

brainsmith project info

(Option B): Docker-based Development¶

Prerequisites

Docker configured to run without root

Edit ctl-docker.sh or set environment variables to directly set brainsmith project settings

export BSMITH_XILINX_PATH=/tools/Xilinx/
export BSMITH_XILINX_VERSION=2024.2

Start container

./ctl-docker.sh start

Open an interactive shell to check your configuration

./ctl-docker.sh shell
brainsmith project info

Or send one-off commands to the container

./ctl-docker.sh "brainsmith project info"

Command-Line Interface¶

Brainsmith provides two CLIs:

smith - Streamlined CLI for creating dataflow accelerators:

smith model.onnx blueprint.yaml    # Create dataflow accelerator

brainsmith - Full toolkit with administrative commands:

brainsmith project init                # Initialize project
brainsmith registry                    # List registered components
brainsmith setup cppsim                # Setup C++ simulation

All commands support --help for details. See CLI Reference for complete documentation.

Project Management¶

Brainsmith operates from a single poetry venv from the repository root, but you can create isolated workspaces via the project system with independent configurations, build directories, and component registries.

Creating Projects¶

# Activate brainsmith venv if not in an active project
source /path/to/brainsmith/.venv/bin/activate

# Create and initialize new project directory
brainsmith project init ~/my-fpga-project
cd ~/my-fpga-project

Edit the default generated config file

vim ~/my-fpga-project/brainsmith.yaml

[Optional] Enable auto-activation if using direnv

brainsmith project allow-direnv
cd .  # Triggers direnv

Otherwise, refresh env after any config changes:

source .brainsmith/env.sh

Configuration¶

Project settings are loaded from multiple sources with the following priority (highest to lowest) for deep user control, but the recommended interface is the yaml config file using CLI arguments to override as necessary.

CLI arguments - Passed to load_config() or command-line tools
Environment variables - BSMITH_* prefix (e.g., BSMITH_BUILD_DIR)
Project config file - brainsmith.yaml in project root
Built-in defaults - Field defaults in SystemConfig

Relative Path Resolution

CLI args: Resolve from current working directory
YAML/env: Resolve from project root

Brainsmith Settings¶

Field	Type	Default	Description
`build_dir`	`Path`	`"build"`	Build directory for compilation artifacts. Relative paths resolve to `project_dir`.
`component_sources`	`Dict[str, Path \| None]`	`{'project': None}`	Filesystem-based component source paths. `'project'` defaults to `project_dir` (supports `kernels/` and `steps/` subdirectories). Core namespace (`'brainsmith'`) and entry points (`'finn'`) are loaded automatically.
`source_priority`	`List[str]`	`['project', 'brainsmith', 'finn', 'custom']`	Component source resolution priority (first match wins). Custom sources are auto-appended if not listed.
`source_module_prefixes`	`Dict[str, str]`	`{'brainsmith.': 'brainsmith', 'finn.': 'finn'}`	Module prefix → source name mapping for component classification.
`components_strict`	`bool`	`True`	Enable strict component loading (fail on errors vs. warn). Set to `false` for development.
`cache_components`	`bool`	`True`	Enable manifest caching for component discovery.

External Tool Settings¶

Field	Type	Default	Description
`netron_port`	`int`	`8080`	Port for Netron neural network visualization server.
`xilinx_path`	`Path`	`"/tools/Xilinx"`	Xilinx root installation path.
`xilinx_version`	`str`	`"2024.2"`	Xilinx tool version (e.g., `"2024.2"`, `"2025.1"`).
`vivado_ip_cache`	`Path \\| None`	`{build_dir}/vivado_ip_cache`	Vivado IP cache directory for faster builds.
`vendor_platform_paths`	`str`	`"/opt/xilinx/platforms"`	Colon-separated vendor platform repository paths.

Runtime Configuration¶

Field	Type	Default	Description
`default_workers`	`int`	`4`	Default number of workers for parallel operations. Exported as `NUM_DEFAULT_WORKERS`.
`logging.level`	`str`	`"normal"`	Console verbosity: `quiet` \| `normal` \| `verbose` \| `debug`.
`logging.finn_tools`	`Dict[str, str]`	`None`	Per-tool log levels for FINN tools (e.g., `{'vivado': 'WARNING', 'hls': 'INFO'}`).
`logging.suppress_patterns`	`List[str]`	`None`	Regex patterns to suppress from console output (file logs unaffected).
`logging.max_log_size_mb`	`int`	`0`	Maximum log file size in MB (0 = no rotation).
`logging.keep_backups`	`int`	`3`	Number of rotated log backups to keep.

Additional configuration fields (FINN settings, direct Xilinx tool paths, etc.) can be set directly, are recommended to let auto-configure from the core brainsmith fields.

Running Design Space Exploration¶

Prerequisites¶

Make sure you've completed the installation above and activated your environment:

# Option 1: direnv users
cd /path/to/brainsmith

# Option 2: Manual activation
source .venv/bin/activate && source .brainsmith/env.sh

Run Your First DSE¶

1. Navigate to the BERT Example¶

cd examples/bert

2. Run the Quick Test¶

./quicktest.sh

This automated script will:

Generate a quantized ONNX model - Single-layer BERT for rapid testing
Generate a folding configuration - Minimal resource usage
Build the dataflow accelerator - RTL generation with FINN
Run RTL simulation - Verify correctness

Build Time

The quicktest takes approximately 30-60 minutes, depending on your system. The build process involves:

ONNX model transformations
Kernel inference and specialization
RTL code generation
IP packaging with Vivado
RTL simulation

3. Monitor Progress¶

The script outputs detailed progress to the console and log files. The build process transforms your model through several stages:

Transformation Pipeline:

PyTorch → ONNX → Hardware Kernels → HLS/RTL → IP Cores → Bitfile

Key stages you'll see:

Model transformation: Converting ONNX operations to hardware kernels
Design space exploration: Determining parallelization factors (PE/SIMD)
Code generation: Generating HLS C++ and RTL (Verilog/VHDL)
IP packaging: Creating Vivado IP cores
Simulation: Verifying correctness with RTL simulation

Check build/quicktest/brainsmith.log for detailed progress and diagnostics.

Explore Results¶

Results are saved in examples/bert/quicktest/:

quicktest/
├── model.onnx              # Quantized ONNX model
├── final_output/           # Generated RTL and reports
│   ├── stitched_ip/       # Synthesizable RTL
│   │   ├── finn_design_wrapper.v
│   │   └── *.v
│   └── report/            # Performance estimates
│       └── estimate_reports.json
└── build_dataflow.log     # Detailed build log

Understanding the Output¶

Performance Report¶

Check final_output/report/estimate_reports.json:

{
  "critical_path_cycles": 123,
  "max_cycles": 456,
  "estimated_throughput_fps": 1234.5,
  "resources": {
    "LUT": 12345,
    "FF": 23456,
    "BRAM_18K": 34,
    "DSP48": 56
  }
}

RTL Output¶

The generated RTL is in final_output/stitched_ip/:

finn_design_wrapper.v — Top-level wrapper with AXI stream interfaces
Individual kernel implementations (e.g., MVAU_hls_*.v, Thresholding_rtl_*.v)
Stream infrastructure (FIFOs, width converters, etc.)

Customize the Design¶

Now that you've run the basic example, try customizing it:

Adjust Target Performance¶

Edit bert_quicktest.yaml to increase throughput:

finn_config:
  target_fps: 10  # Increase from 1 to 10 FPS

Higher target FPS will:

Increase parallelization factors (PE/SIMD parameters)
Use more FPGA resources (LUTs, DSPs, BRAM)
Reduce latency per inference

Create Custom Configurations¶

Create your own blueprint that inherits from the BERT base:

# my_custom_bert.yaml
name: "My Custom BERT"
extends: "${BSMITH_DIR}/examples/blueprints/bert.yaml"

clock_ns: 4.0           # 250MHz (faster clock)
output: "estimates"     # Just get resource estimates

finn_config:
  target_fps: 5000      # Very high throughput

Then run it:

python bert_demo.py -o my_output -l 2 --blueprint my_custom_bert.yaml

Run Full BERT Demo¶

The full demo processes larger models. From examples/bert/:

# Generate standard folding configuration
python gen_folding_config.py --simd 16 --pe 16 -o configs/demo_folding.json

# Run with 4 layers instead of 1
python bert_demo.py -o bert_demo_output -l 4 --blueprint bert_demo.yaml

This creates a more realistic accelerator but takes significantly longer to build.

Understanding Blueprints¶

A Blueprint is a YAML configuration that defines your hardware design space - the kernels, transformation steps, and build parameters.

Blueprint Inheritance¶

Blueprints support inheritance via the extends key, allowing you to build on existing configurations:

# bert_quicktest.yaml - Quick test configuration
name: "BERT Quicktest"
extends: "${BSMITH_DIR}/examples/bert/bert_demo.yaml"

output: "bitfile"

finn_config:
  target_fps: 1                     # Low FPS for quick testing
  folding_config_file: "configs/quicktest_folding.json"
  fifosim_n_inferences: 2           # Faster FIFO sizing

Getting Help¶

Build logs: Check <output_dir>/brainsmith.log for detailed error messages and transformation steps.

Resources:

GitHub Issues - Report bugs or search existing issues
GitHub Discussions - Ask questions and share experiences