Conversation Summarisation with RL

Overview

This repository demonstrates a two-phase training pipeline for dialogue summarisation:

Supervised Fine‑Tuning: T5-small and T5-base models are fine‑tuned on the SAMSum dialogue summary dataset.
Reinforcement Learning (RL): The fine‑tuned checkpoints are further trained using Proximal Policy Optimization (PPO) to generate non‑toxic, plain summaries of toxic dialogues using a custom synthetic dataset.

Features

Dialogue Summarisation: Leverages the SAMSum corpus for high‑quality conversational summaries.
Synthetic Toxic Dataset: Contains toxic dialogue inputs with neutral summaries to guide detoxification.
Reinforcement Learning: PPO-based training via Hugging Face’s TRL library to minimise toxicity.
Evaluation Suite: Automated ROUGE metric evaluation and toxicity checks using "facebook/roberta-hate-speech-dynabench-r4-target" model.

Installation

Clone the repository

git clone https://github.com/dark-horiznz/Conversation-Summarisation-with-RL.git
cd Conversation-Summarisation-with-RL

Set up a virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Data Preparation

SAMSum: Automatically loaded via:

from datasets import load_dataset
samsum = load_dataset("knkarthick/samsum")

Toxic Conversations: Provided in data/toxic_conversations.json.

from datasets import load_dataset
samsum = load_dataset("majorSeaweed/toxic-dialogue-summarisation")

Supervised Fine‑Tuning

Fine‑tune on the SAMSum dataset using the Hugging Face Trainer

Reinforcement Learning (PPO)

Apply PPO to detoxify summaries on the toxic dataset

Reward Function: Combines toxicity penalty (via a pretrained classifier) and fluency rewards.

Inference

Generate summaries with the trained PPO model

Evaluation

ROUGE Metrics: Evaluate using built‑in evaluation scripts to compare against SAMSum references.
Toxicity Checks: Run a toxicity classifier on output summaries to verify detoxification.

Results

Model	ROUGE‑1	ROUGE‑2	ROUGE‑L
t5-small (supervised)	52.1	26.3	49.7
t5-small (PPO)	53.4	27.1	51.2
t5-base (supervised)	54.8	29.0	52.4
t5-base (PPO)	56.2	30.5	53.9

SFT Model Comparsion

PPO Model Comparision (Toxicity Scores by BERT)

1. T5 Base

2. T5 Small

Contributing

Contributions welcome! Please open an issue or pull request with enhancements or bug fixes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
De-toxifying model with RL		De-toxifying model with RL
Fine-tuning models		Fine-tuning models
LICENSE		LICENSE
README.md		README.md
toxic_conversations.json		toxic_conversations.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Conversation Summarisation with RL

Overview

Features

Installation

Data Preparation

Supervised Fine‑Tuning

Reinforcement Learning (PPO)

Inference

Evaluation

Results

SFT Model Comparsion

PPO Model Comparision (Toxicity Scores by BERT)

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

dark-horiznz/Conversation-Summarisation-with-RL

Folders and files

Latest commit

History

Repository files navigation

Conversation Summarisation with RL

Overview

Features

Installation

Data Preparation

Supervised Fine‑Tuning

Reinforcement Learning (PPO)

Inference

Evaluation

Results

SFT Model Comparsion

PPO Model Comparision (Toxicity Scores by BERT)

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages