AIfred Intelligence

Example Conversations & Showcases

Multi-Agent Debates

🏆 Showcase: Hund oder Katze – Philosophische Debatte

Why this matters: Research shows multi-agent debate systems struggle with "rubber-stamping" (critics just agreeing), echo chambers, and information loss during synthesis. This debate demonstrates AIfred avoiding all these failure modes – with a local 30B model.

A trivial question evolves through four categorical phases: Character typology → Virtue ethics → Relationship theory → Meta-ethics of equality. Sokrates delivers real critique, Salomo synthesizes without information loss.

Auto-Konsens 3 Agents 2 Rounds Qwen3:30B Local
Konsens (DE) 🔊 Consensus (EN) 🔊 Tribunal (DE) Tribunal (EN) Konsens-Analyse (DE) Consensus Analysis (EN) Tribunal-Analyse (DE) Tribunal Analysis (EN)

⚖️ Showcase: Tribunal Mode – Error Handling Debate

Tribunal vs Auto-Consensus: In Tribunal mode, Sokrates acts as prosecutor (not coach). AIfred must DEFEND or REVISE – there's no [LGTM] voting. This A/B comparison shows how personality prompts affect the adversarial debate dynamic.

The question "Should every error be logged?" triggers a structured debate where AIfred defends his position against Sokrates' attacks, and Salomo delivers a final verdict. Compare WITH vs WITHOUT personality prompts to see the stylistic differences.

Tribunal Mode 3 Agents A/B Comparison Qwen3:30B Local
Full Debate WITH (DE) Full Debate WITH (EN) Full Debate WITHOUT (DE) Full Debate WITHOUT (EN) Analysis (DE) Analysis (EN)

🔬 A/B Test: Code Review – WITH vs WITHOUT Personalities

Does personality affect quality? This side-by-side comparison shows the same "Should I split this Python function?" question answered with and without AIfred's Butler personality. Both use Auto-Consensus mode with [LGTM]/[WEITER] voting.

Result: Personality prompts add stylistic flair (British expressions, philosophical references) but the core technical analysis remains equivalent. This validates the 3-layer prompt architecture: Identity (who) + Personality (how, optional) + Task (what).

Auto-Consensus A/B Testing Personality Study Qwen3:30B Local
WITH Personality (DE) WITH Personality (EN) WITHOUT Personality (DE) WITHOUT Personality (EN)
Science & Math

Chemistry: Balancing Combustion Equations

AIfred explains how to balance the combustion of ethanol step-by-step, with proper chemical notation rendered via mhchem. Features a coefficient table and verification of the law of conservation of mass.

Chemistry Example
Standard Mode mhchem gpt-oss:120b
View Full Chat

Physics: Schrodinger Equation for a Victorian Gentleman

"Explain the Schrodinger equation as if I'm a Victorian gentleman" - AIfred rises to the challenge with historical context, elegant LaTeX formulas, and analogies to a gentleman's drawing-room. A masterclass in making quantum mechanics accessible.

Math Example
Standard Mode KaTeX gpt-oss:120b
View Full Chat
Coding

Python: Prime Number Calculator

A refined implementation of the Sieve of Eratosthenes algorithm, complete with type hints, docstrings, and Butler-style code comments. Shows AIfred's ability to write clean, well-documented code while maintaining his characteristic charm.

Coding Example
Standard Mode Python gpt-oss:120b
View Full Chat
Performance & Benchmarks

Tensor Split Benchmark: Speed vs. Full Context

Does aggressive GPU placement matter? When running a 46.6 GB model across two unequal GPUs (RTX 8000 + Tesla P40), the tensor split ratio determines how much computation happens on the fast vs. slow GPU. This benchmark compares a balanced 2:1 split (full 262K context) against an aggressive 11:1 split (32K context, 92% on the fast GPU).

Measured through a real 6-turn AIfred tribunal debate ("Is water wet?") with 3 agents across 2 rounds. Results: 10–15% faster generation in Round 1, shrinking to ~4% in Round 2. Prompt processing is ~2% slower with aggressive split. Total wall-clock time: 10 seconds saved (113s vs 124s). Zero quality difference.

Benchmark Multi-GPU Tensor Split Qwen3-Next:80B Local
View Full Benchmark Raw Data (Markdown)
Web Research

Medical Research: Spinal Anesthesia Guidelines

A complex medical query about spinal anesthesia in patients with myasthenia gravis. AIfred automatically searches medical literature, synthesizes findings from multiple sources, and provides a cautious, well-referenced answer with proper citations.

Web Research Example
Research Mode 4 Sources qwen3:14b
View Full Chat