Evaluate, observe, and gate LangGraph and CrewAI agents before they reach production. Built by a Senior AI Engineer at PayPal.
from cortexops import CortexTracer, EvalSuite tracer = CortexTracer(project="payments-agent") # key auto-loaded agent = tracer.wrap(your_langgraph_app) # zero refactoring results = EvalSuite.run( dataset="golden_v1.yaml", agent=agent, fail_on="task_completion < 0.90", # CI gate ) print(results.summary())
Click any trace row to see the node waterfall — exactly which step took how long, which tools were called, and what the output was. Debug a 2am incident in 30 seconds.
Four steps. No refactoring. Works with any LangGraph or CrewAI agent.
LangSmith charges $39/seat plus $2.50–$5.00 per 1,000 traces. At 50k traces/month that's $164 per seat. CortexOps is $49/seat. Flat.
No credit card required for free tier. Pro starts with a cancel anytime.
18 pages covering installation, golden datasets, CI gate, LangGraph, CrewAI, API reference, and more. No GitHub redirect.
Install the SDK and run your first eval in under 2 minutes.
# Install pip install cortexops from cortexops import CortexTracer, EvalSuite tracer = CortexTracer(project="my-agent") agent = tracer.wrap(my_agent) results = EvalSuite.run( dataset="golden_v1.yaml", agent=agent, verbose=True, )