We're digitizing Canada's Parliament.

Search the Canadian Hansard from 1901 to today.

Advanced Search...

Welcome to LiPaD:

The Linked Parliamentary Data Project

About the Canadian Hansard Dataset

The transcript of Parliamentary Debates (“Hansard”) is a 150-year running record of Canadian political history. The richness of this record presents a needle-in-a-haystack problem of enormous magnitude. At a rate of a novel’s worth of reading each day, it would take 66 years to read the transcripts of the House and the Senate. It would take a further 28 years to read what was added in the interim.

In 2013, a group of political scientists, computer scientists, and historians teamed up at the University of Toronto to solve this problem. With support from the SSHRC, the NSERC, the Digging into Data initiative, the Library of Parliament, Library and Archives Canada, Canadiana.org, and Michael Mulley at openparliament.ca, a key output of this collaboration is the first machine-readable and fully searchable historical Hansard. We have linked to these data to various biographical properties of parliamentarians, including their party and gender. This corpus underpins the search interface of this website.

We are continuously expanding this corpus and features on this website. We are adding the Debates of the Senate and the transcripts of parliamentary committees. We are also integrating more and more information about parliamentarians, including information about the demographic profiles and election outcomes in their constituencies. To request data or for information about how to get involved in this project, please contact us.

A full description of our project in a published article for citation purposes is forthcoming.

Qu'en est-il de la version française?

La numérisation de la version française du Hansard de la Chambre des communes est une priorité pour notre équipe. À l'heure actuelle, nous ne sommes malheureusement pas en mesure de rendre publique une telle version.
Lire la suite...

About This Site

This site was built using Python, Django, UIKit, PostgreSQL, and Haystack/Solr by twhyte.net.

It also makes use of the openparliament.ca API developed and maintained by Michael Mulley.

Complete source code is available at GitHub. A current list of dataset bugs is available on our Help page.

About us

We are a group of political scientists, computer scientists, and historians at the University of Toronto.

Kaspar Beelen is a postdoctoral fellow in the Departments of Political Science and Computer Science; Christopher Cochrane is an Associate Professor in the Department of Political Science; Graeme Hirst is a Professor in the Department of Computer Science; Nona Naderi is a Ph.D. student in the Department of Computer Science;Ludovic Rheault is an Assistant Professor in the Department of Political Science;and Tanya Whyte is a Ph.D. student in the Department of Political Science.

What about the French version?

Digitizing the French version of the Hansard is a priority for our research team. Unfortunately, we are not able to release it at this time.


This project was made possible through financial support from the Social Sciences and Humanities Research Council, the National Sciences and Engineering Research Council, and the Digging into Data initiative.

We are grateful to a number of colleagues who have provided helpful advice and feedback about this project. These include Jonathan Blaney (University of London); Richard Gartner (King’s College, London); Jaap Kamps (University of Amsterdam); Maarten Marx (University of Amsterdam); Paul Seaward (History of Parliament); Jane Winter (University of London). We have also benefited from the outstanding research assistance of Michael Kimms, Roman Polyanovsky, and Tim Alberdingk Thijm.

Although this project is much improved from the support of our funders and our colleagues, we assume full responsibility for the remaining errors.

Latest News

October 25, 2016 | Dataset Updates October 2016

The Lipad dataset has been updated to fix some errors and omissions discovered over the past few months. The following is a summary of important changes:

  • Approximately 80,000 speeches over the mid-1990s and early 2000s were not properly linked with their ParlINFO file and member information
  • Approximately 300 members across the dataset were missing select positions in their parliamentary career file
  • Downloading month-by-month .csv data exports has been replaced by a complete file of the Lipad dataset in a series of daily .csv files. This new package is available on the Data page of the site.

We suggest all users of an offline copy of the dataset re-download source files from the Data page.

Read More