Langsmith enhances LLM assessment with Pytest and Vitest integration

Caroline Bishop
January 25, 2025 04:44

To enhance the assessment of LLM applications, Langsmith has introduced Pytest and Vitest integrations, providing developers with an improved testing framework.

Langsmith aims to streamline the evaluation process for Languages Model (LLM) applications by unveiling new integrations with Pytest and Vitest. According to Langchain’s blog, this integration is currently in beta with version 0.3.0 of the Langsmith Python and TypeScript SDK, providing improved testing capabilities for developers.

Enhanced Testing Framework for LLM Assessment

The LLM Assessment (EVAL) is important to maintain the credibility and quality of your application. By integrating with Pytest and Vitest, developers familiar with these frameworks can now take advantage of Langsmith’s advanced features, such as observability and sharing capabilities, without compromising the developer experience.

Integrations allow developers to debug tests more effectively, log detailed metrics beyond simple pass/fail results, and easily share results across the team. The non-deterministic nature of LLM adds complexity to debugging. Langsmith stores and addresses the input, output, and stack trace of a test case.

Use built-in evaluation features

Langsmith offers the following built-in assessment features: expect.edit_distance()Calculate the string distance between test output and reference output. This feature is especially useful for developers whose applications need to continuously release the best version. Detailed insight into these features can be found in Langsmith’s API reference.

Start with Pytest and Vitest

Integration with Pytest requires the developer to add @pytest.mark.langsmith Decorator in test case. This setup records all test case results, application traces, and traces for Langsmith, giving you a comprehensive view of your application’s performance.

Similarly, Vitest users can create test cases ls.describe() Blocking to achieve the same level of integration and logging. Both frameworks provide real-time feedback and can be seamlessly integrated into continuous integration (CI) pipelines, helping developers catch regressions early.

Advantages over traditional evaluation methods

Traditional evaluation methods often require predefined datasets and evaluation functions, which can be limiting. Langsmith’s new integration provides flexibility by allowing developers to define specific test cases and evaluation logic tailored to the needs of their application. This approach is particularly advantageous for applications that need to be tested across multiple tools or models with different evaluation criteria.

The real-time feedback provided by these testing frameworks facilitates rapid iteration and local development, allowing developers to quickly improve their applications. Additionally, integration with the CI pipeline ensures that potential regressions are identified and resolved early in the development process.

For more information on how to leverage these integrations, you can refer to the how-to guides available on Langsmith’s comprehensive tutorials and documentation site.

Image source: Shutterstock

Langsmith enhances LLM assessment with Pytest and Vitest integration

Algorand (Algo) Get momentum in the launch and technical growth.

It flashes again in July

Stablecoin startups surpass 2021 venture capital peaks as institutional money spills.

MultiBank Group To List $MBG Token On Gate.io And MEXC During Official Token Generation Event

Earn $4,777 Daily! PaxMining Leads 2025’s Record-Breaking Bitcoin Mining Boom

GSR Leads $100M Private Placement Into Nasdaq-listed MEI Pharma To Launch First Institutional Litecoin Treasury Strategy Alongside Charlie Lee

KuCoin Launches XStocks, Delivering A One-Stop Access Point To Top Global Tokenized Equities

💵 FREE $18 USDT – Just For Signing Up!

How Does AIXA Mining Break Traditional Barriers?

How Does AIXA Mining Break Traditional Barriers?

The future of EF ecosystem development

Satoshi-Aera Bitcoin Whale moves another 40K BTC to Galaxy Digital.

Behind The Surge In XRP, DLMining Brings New Opportunities To Mine BTC With Your XRP

The strategy has hit the highest market cap since the Rally Bitcoin rally.

Top Insights

MultiBank Group To List $MBG Token On Gate.io And MEXC During Official Token Generation Event

Earn $4,777 Daily! PaxMining Leads 2025’s Record-Breaking Bitcoin Mining Boom

GSR Leads $100M Private Placement Into Nasdaq-listed MEI Pharma To Launch First Institutional Litecoin Treasury Strategy Alongside Charlie Lee

Most Popular

Investors in Option2Trade (O2T), Pyth Network (PYTH), and Injective (INJ) could win a whopping $888,000.

The Hong Kong Monetary Authority sets regulatory standards for tokenized products.

Climb the ranks with the new Futures Leaderboard

Langsmith enhances LLM assessment with Pytest and Vitest integration

Enhanced Testing Framework for LLM Assessment

Use built-in evaluation features

Start with Pytest and Vitest

Advantages over traditional evaluation methods

Related Posts