Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»Composio’s SWE agent achieved 48.6% on SweBench using LangGraph and LangSmith.
ADOPTION NEWS

Composio’s SWE agent achieved 48.6% on SweBench using LangGraph and LangSmith.

By Crypto FlexsNovember 11, 20243 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Composio’s SWE agent achieved 48.6% on SweBench using LangGraph and LangSmith.
Share
Facebook Twitter LinkedIn Pinterest Email

jack anderson
November 11, 2024 18:08

Composio’s SWE agent leveraging LangGraph and LangSmith achieved a score of 48.6% on SweBench, demonstrating advancements in open source AI-based software engineering.





Composio’s SWE agent achieved a score of 48.6% on the SweBench benchmark, demonstrating significant progress in the area of ​​open source software engineering. According to LangChainAI, this achievement highlights the agent’s ability to effectively solve real-world software engineering problems by leveraging LangGraph and LangSmith.

Performance on SweBench

SweBench is a rigorous benchmark designed to evaluate the effectiveness of coding agents on real-world tasks. It contains 2,294 GitHub issues from well-known Python libraries such as Django, SymPy, Flask, and Scikit-learn. In a subset of 500 human-validated problems, the SWE agent successfully solved 243 problems, ranking fourth overall and second among open source contributions.

Innovative agent architecture

The architecture of the SWE agent is built on LangGraph, which models the agent as a state machine for efficient state management. This approach goes beyond traditional agent communication methods by using state graphs to effectively manage agent interactions and hidden states. Each agent acts as a state machine, ensuring a stable and transparent workflow.

Monitoring with LangSmith

LangSmith plays a critical role in monitoring the non-deterministic nature of agent operations and providing comprehensive logging and a holistic view of agent operations. This integration with LangGraph increases the system’s ability to improve tools by providing detailed visibility into each step of the problem-solving process.

Professional agent to improve performance

SWE Agents employ specialized agents, each with a unique set of tools for specific tasks. It includes a software engineering agent for task delegation, a CodeAnalyzer agent for codebase analysis, and an editor agent for code exploration and modification. This specialization allows each agent to focus on well-defined tasks, improving overall performance.

State Management and Workflow

LangGraph’s architecture facilitates effective state management in multi-agent systems. We implement a sophisticated state management system to prevent hidden state traps while maintaining clear boundaries and transitions. Agents are guided by router functions that use message markers to control state transitions, ensuring that they only engage in relevant tasks.

The LangGraph workflow consists of three agent nodes and a tool node, each with predefined tasks and tools. This structured approach ensures clear task delegation and modularity, preventing duplication and unintended side effects.

Strengthening developer capabilities

The SWE-Kit platform offers a modular design that allows developers to create custom agents for specific workflows. This flexibility extends beyond software engineering to applications in CRM, HRM, and administrative tasks. Composio aims to help developers build intelligent agents that can transform workflows across a variety of industries.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

SOL price remains capped at $140 as altcoin ETF competitors reshape cryptocurrency demand.

December 5, 2025

Michael Burry’s Short-Term Investment in the AI ​​Market: A Cautionary Tale Amid the Tech Hype

November 19, 2025

BTC Rebound Targets $110K, but CME Gap Cloud Forecasts

November 11, 2025
Add A Comment

Comments are closed.

Recent Posts

The Sandbox Ecosystem Welcomes Web3 Platform Corners, Beta Now Available To Coin Internet Content

December 9, 2025

BTCC Exchange Integrates With TradingView, Bringing Professional Trading Tools To Its 10 Million Global Users

December 9, 2025

Tether’s USDT stablecoin receives regulatory approval in Abu Dhabi

December 9, 2025

TrustLinq Seeks To Solve Cryptocurrency’s Multi-Billion Dollar Usability Problem

December 9, 2025

Ethereum inches toward a critical decision point: bullish breakout or deeper dive?

December 9, 2025

Superform brings institutional-level yields to everyday users with its new Stablecoin Neobank product.

December 9, 2025

I need to use a voucher with lights, is there a Linux application that can do this?

December 8, 2025

Bybit Institutional Sets The Stage For 2026 At High-Profile Abu Dhabi Gala

December 8, 2025

ONDO price soars after SEC concludes confidential investigation with no charges

December 8, 2025

Moca Network Launches MocaProof Beta, The Digital Identity Verification And Reward Platform

December 8, 2025

SemiLiquid Unveils Programmable Credit Protocol, Built With Avalanche, Advancing Institutional Credit On Tokenised Collateral

December 8, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

The Sandbox Ecosystem Welcomes Web3 Platform Corners, Beta Now Available To Coin Internet Content

December 9, 2025

BTCC Exchange Integrates With TradingView, Bringing Professional Trading Tools To Its 10 Million Global Users

December 9, 2025

Tether’s USDT stablecoin receives regulatory approval in Abu Dhabi

December 9, 2025
Most Popular

BlockDAG leads the pack with 30,000x revenue, outperforming DeeStream and Fezoo in the pre-sale phase.

April 12, 2024

Here’s why MakerDAO rebranded to Sky and what’s changed:

August 28, 2024

US lawmakers push for legislation against deepfake images in response to Taylor Swift incident

January 27, 2024
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.