Class DatasetManager

Manager for dataset operations in Langfuse.

Provides methods to retrieve datasets and their items, with automatic pagination handling and convenient linking functionality for experiments.

Index

Constructors

constructor

Methods

get

Constructors

constructor

new DatasetManager(params: { langfuseClient: LangfuseClient }): DatasetManager
Internal
Creates a new DatasetManager instance.
Parameters
- params: { langfuseClient: LangfuseClient }
  Configuration object containing the API client
Returns DatasetManager
- Defined in client/src/dataset/index.ts:151

Methods

get

get(
name: string,
options?: { fetchItemsPageSize: number },
): Promise<FetchedDataset>

Retrieves a dataset by name with all its items and experiment functionality.

This method fetches a dataset and all its associated items, with support for automatic pagination to handle large datasets efficiently. The returned dataset object includes enhanced functionality for linking items to traces and running experiments directly on the dataset.

Parameters

name: string
The name of the dataset to retrieve
Optionaloptions: { fetchItemsPageSize: number }
Optional configuration for data fetching
- fetchItemsPageSize: number
  Number of items to fetch per page (default: 50)

Returns Promise<FetchedDataset>

Promise resolving to enhanced dataset with items, linking, and experiment capabilities

Example: Basic dataset retrieval

const dataset = await langfuse.dataset.get("my-evaluation-dataset");
console.log(`Dataset ${dataset.name} has ${dataset.items.length} items`);

// Access dataset properties
console.log(dataset.description);
console.log(dataset.metadata);

Example: Working with dataset items

const dataset = await langfuse.dataset.get("qa-dataset");

for (const item of dataset.items) {
  console.log("Question:", item.input);
  console.log("Expected Answer:", item.expectedOutput);

  // Each item has a link function for connecting to traces
  // await item.link(span, "experiment-name");
}

Example: Running experiments on datasets

const dataset = await langfuse.dataset.get("benchmark-dataset");

const result = await dataset.runExperiment({
  name: "GPT-4 Benchmark",
  runName: "GPT-4 Benchmark v1.2", // optional exact run name
  description: "Evaluating GPT-4 on our benchmark tasks",
  task: async ({ input }) => {
    const response = await openai.chat.completions.create({
      model: "gpt-4",
      messages: [{ role: "user", content: input }]
    });
    return response.choices[0].message.content;
  },
  evaluators: [
    async ({ output, expectedOutput }) => ({
      name: "exact_match",
      value: output === expectedOutput ? 1 : 0
    })
  ]
});

console.log(await result.format());

Example: Handling large datasets

// For very large datasets, use smaller page sizes
const largeDataset = await langfuse.dataset.get(
  "large-dataset",
  { fetchItemsPageSize: 100 }
);

Throws

If the dataset does not exist or cannot be accessed

See

FetchedDataset for the complete return type specification
RunExperimentOnDataset for experiment execution details

Since

4.0.0

Class DatasetManager

Index

Constructors

Methods

Constructors

constructor

Parameters

Returns DatasetManager

Methods

get

Parameters

fetchItemsPageSize: number

Returns Promise<FetchedDataset>

Example: Basic dataset retrieval

Example: Working with dataset items

Example: Running experiments on datasets

Example: Handling large datasets

Throws

See

Since

Settings

On This Page