Trilogy AI COE LogoTrilogy AI Center of Excellence

Publications

Articles, papers, and blog posts from our research team.

Scientific Discourse for Builders
2025-09-19 | Education

How to read, question, and apply AI papers

Browsing, Rewired: My Dive into the AI Browser Frontier
2025-09-15 | Emerging Topics

First it was Dia, then came Comet. I downloaded Fellou.ai the other day, which bills itself as the first “agentic browser.” As I type this I’m also installing GenSpark’s new AI browser.

Nano Banana and the Rise of Conversational Creation
2025-09-01 | Emerging Topics

Why Gemini 2.5 Flash Image marks a permanent shift in creative workflows

Autonomous…ish: Why Two Newcomers Lapped Jules and Devin on Real Work
2025-08-27 | Agent Systems

Genspark & Abacus Ship, Jules & Devin Slip

The Six Pillars of Spec-Driven Work
2025-08-22 | Emerging Topics

Kiro and the orchestration of multi-tool pipelines for human–AI teams

Building the AI COE Chatbot
2025-08-19 | Agent Systems

Willfully over-engineering a simple RAG bot to explore agentic workflows

The One Rule That Made My AI Tutor 3× Cheaper (Without Losing Accuracy)
2025-08-14 | Evaluation

Cost‑Aware, Format‑Strict, and Surprisingly Minimal

Useful or Not: Declarative Self-improving Python
2025-08-13 | Evaluation

Quick Dive: An honest evaluation of where DSPy excels, what my implementation adds, and how you should (or shouldn't) use it

Reinforcement Learning For Agents - Part II
2025-08-11 | Agent Systems

A comparison of Agent Lightning, Handit.ai, and a Homegrown tool - AgentEvolve

Building an AI Coach for WorkSmart
2025-08-08 | Emerging Topics

Always-on AI coaching that keeps every employee focused, sane, and one step ahead.

Lights, Camera, Algorithm
2025-08-07 | Emerging Topics

Hands‑On with 2025’s AI Video Tools (and Why 8 Seconds Still Hurts)

Building Data Aggregation in Nexus Agents
2025-08-06 | Agent Systems

From Concept to Production with AI-Powered Development

Any Chatbot Can Become a Living Expert
2025-08-04 | Emerging Topics

The Simple Path from Text to Voice Avatar: Everyone can create a chatbot - I transformed any template-based chatbot into a visual, voice-enabled expert with complete control and scalability.

Reinforcement Learning Techniques to Optimize Agents
2025-08-01 | Agent Systems

Can RL loops continuously refine prompts, tools, and agentic pipelines?

From Precision to Scale: AI-Enabled Crawler
2025-07-28 | Emerging Topics

How combining existing tools and best practices helped me tackle the challenge of discovering and validating educational resources at scale

Qwen 3 Redefines Open‑Source AI Power
2025-07-27 | Emerging Topics

Meet the Three Musketeers of coding, reasoning, and instruction

Quantifying Expertise Inflation
2025-07-23 | Emerging Topics

From Satire to Scientific Measurement

Auto-Improve Bitcoin Algo Trading Strategies with LLMs
2025-07-22 | Emerging Topics

How to Build & Auto-Refine Algorithms Using Multi-Model LLM Loops

Useful or Not: DeepAgent
2025-07-17 | Evaluation

How enterprises can extract valuable technical patterns from DeepAgent's sophisticated design while demanding empirical validation

Clash of the Titans
2025-07-17 | Evaluation

Grok 4 vs. Kimi K2

Agentic Automation for Social Content
2025-07-15 | Agent Systems

Enterprise Content Orchestration for Content Creation, Approval and Scheduling with n8n & Airtable

Iterative AI System for Universal Discovery
2025-07-14 | Emerging Topics

10-engine system learning from each run; LLM ‘orchestrator’; open source APIs for 2,000+ vetted resources; AI-driven build-while-learning approach—enhanced with Google’s GenAI Processors architecture

AI Vision and the Future of UI Testing
2025-07-10 | Emerging Topics

A Hybrid Approach to Software Quality

AI Music Videos
2025-07-08 | Emerging Topics

A modern workflow

Analyzing Large Datasets with LLMs
2025-07-08 | Evaluation

How to Tame Context Limits, Retrieve Structured Data, and Build Reasoning Agents for Enterprise-Scale Insights

The Memory Framework Mirage: Data-Driven Reasons to Go Context-First
2025-07-04 | Emerging Topics

Opinion: From LangChain to Mem0, new benchmarks reveal million-token context windows plus a simple stack present a more compelling case than memory frameworks

The Hidden Cost of Scattered AI Tooling
2025-07-01 | Emerging Topics

And a Four-Layer Framework for Scalable Enterprise Adoption

Beyond Adoption: Defining Real AI Impact at Trilogy
2025-06-30 | Emerging Topics

Trilogy’s 73% AI usage is industry-leading — but business value trails. Here’s how we’ll turn high adoption into measurable impact, with standards, proven wins, and a culture of continuous learning

Payloads, Promises, and Protocols: The MCP/A2A Tightrope
2025-06-27 | Agent Systems

A hands-on breakdown of where MCP ends, where A2A begins, and why orchestration, not communication, is the real architectural battleground.

Behavioral Anti-Pattern Detection: A Comprehensive Technical Synthesis
2025-06-24 | Emerging Topics

Discover how AI-driven video analytics uncover, measure, and transform hidden workplace anti-patterns — translating rigorous research into actionable ideas for enterprise productivity and success

Claude Code: Triumphs, Trials & Trade-Offs
2025-06-24 | Evaluation

A deep dive into its architecture, standout features, and where it still falls short

The Multi-Agent Moment
2025-06-24 | Agent Systems

How a Fierce Debate Forged the Blueprint for the Next Generation of AI

Agent-to-Agent Communication: AI's Missing Link
2025-06-19 | Agent Systems

Why AI Agents Can't Talk to Each Other (And How A2A Aims to Fix It)

Agentic Retrieval Deepdive
2025-06-19 | Evaluation

From Off-the-Shelf to Custom: A Benchmarking Study of Agentic Retrieval Pipelines

AI Ping-Pong: Manual Multi-Model Workflow for 98% Content Quality
2025-06-18 | Emerging Topics

The era of single-model content creation is over. 20 minutes vs 120 minutes determines market leadership = 84% efficiency gain

Standardizing AI-to-System Integration
2025-06-15 | Agent Systems

Model Context Protocol

Retrieval Benchmarking: Agentic vs. Vanilla
2025-06-12 | Evaluation

Does agentic retrieval trump vanilla retrieval? What is the top performing combination of datastores and embeddings from a retrieval accuracy perspective

The Autonomous Developer
2025-06-12 | Agent Systems

A Guide to Tools, Trust, and Transparency in AI Coding

Validated 10-Minute AI-to-Slides Workflow
2025-06-12 | Multimodal AI

In todays market, 10 minutes vs 70 minutes determines who wins proposals. This 86% efficiency gain translates directly to competitive advantage worth $41,600 annual capacity per analyst.

Agentic Frameworks
2025-06-05 | Agent Systems

A comprehensive benchmark analysis of popular agentic frameworks including LangChain, LangGraph, CrewAI, and AutoGen, evaluating their performance in real-world scenarios and providing actionable insights for framework selection.

Text-to-Video Generation
2025-05-26 | Multimodal AI

From Theory to Practice with Automated Solutions

Navigating the Agent Framework Maze
2025-04-29 | Agent Systems

Analysis of Framework Architectures, Capabilities, and Multi-Agent Dynamics

Evaluating Agent Systems and Human AI Fluency (Part 2)
2025-04-25 | Evaluation

Assessing Human Readiness and Synergies in Human-AI Evaluation

Evaluating Agent Systems and Human AI Fluency (Part 1)
2025-04-22 | Evaluation

Benchmarking Multi-Agent Coordination, Reliability, and Interoperability

Google's A2A Protocol
2025-04-10 | Agent Systems

Enabling Seamless AI Agent Collaboration

Empowering Learners with AI Tutors
2025-04-07 | Education

The Future of Personalized and Self-Directed Learning

Generating Engaging Visuals for Education
2025-03-31 | Education

A Guide to AI-Powered Tools

Evaluating the Future of Agentic Automation
2025-03-24 | Emerging Topics

Beyond Manus AI

Enhancing LLM Evaluation with G-Eval
2025-03-16 | Evaluation

Creating Effective Datasets and Evaluation Criteria

Bridging AI Islands
2025-03-10 | Agent Systems

MCP Meets OVON in the Quest for True Interoperability

2025 February AI Round-Up
2025-02-28 | Emerging Topics

Key Highlights and Developments

Multi-Agent Deep Research Architecture
2025-02-26 | Agent Systems

Leveraging a Knowledge Base for Continuous, Iterative Discovery

Comparative Analysis of Deep Research Tools
2025-02-22 | Evaluation

Proprietary and Open-Source Solutions

LLM Evaluation Frameworks
2025-02-16 | Evaluation

Overview, Comparison, and Recommendation

Understanding GraphRAG: A Technical Deep Dive
2025-02-10 | Emerging Topics

Bridging Structured Knowledge and Generative AI for Smarter Solutions