Skip to content

Conversation

@paulinek13
Copy link
Contributor

@paulinek13 paulinek13 commented Feb 14, 2025

Description

This PR aims to close #416 by implementing a fetch function for Aya Red-teaming Dataset with filtering options for language, harm_category, and global_or_local parameters.

I've also added helper functions for handling .jsonl files and updated the test configuration to recognize .jsonl as a valid file type, since the Aya Red-teaming Dataset data is stored in this format.

Tests and Documentation

  • pytest tests/unit && pre-commit run --all-files completes without errors
  • fetch_aya_redteaming_dataset function added to the API reference
  • simple unit testing created for the introduced fetch function

paulinek13 and others added 7 commits February 14, 2025 10:04
This commit also adds helper functions to work with .jsonl files,
since the Aya Red-teaming dataset is stored in this format.
This update adds support for `.jsonl` (JSON Lines) files, ensuring that
unit tests pass successfully when these files are used.
Copy link
Contributor

@romanlutz romanlutz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This took me an embarrassingly long time to get to, but it's a fantastic contribution. Thank you @paulinek13

@romanlutz romanlutz merged commit 5bbbce7 into Azure:main Feb 25, 2025
14 checks passed
@paulinek13 paulinek13 deleted the feat/aya-redteaming-dataset branch February 27, 2025 14:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

FEAT Add Aya Red-teaming Dataset

2 participants