Ministry of Justice: Data First
Data First is ana ambitiouspioneering data-linkingdata-linking, research and academic engagement programme led by the Ministry of Justice and funded by ADR UK.
Data First aimsunlocks to unlock the potential of the wealth of data already created by the Ministry of Justice (MOJ),(MOJ) by linkingmaking linked administrative datasets from across the justice system available for research. The programme is led by MOJ and enablingfunded accreditedby researchers,Administrative fromData withinResearch governmentUK (ADR UK), an investment by the Economic and academia,Social toResearch accessCouncil (ESRC).
Data from the datacourts, prison and probation services in anEngland ethicaland Wales have been linked to enable new and responsibleinnovative way.analysis of user journeys, interactions, and outcomes across the justice system. The projectprogramme willis also enhanceenhancing the linking of justice data with other government departments.departments, including education data from the Department for Education’s (DfE) National Pupil Database (NPD).
Data First enables researchers across government and academia to access these datasets in an ethical and responsible way via secure platforms in the ONS Secure Research Service and SAIL Databank.
By working in partnership with academic experts to facilitate and promote research in line with evidence priorities set out in the justiceMOJ space,Areas of Research Interest (ARI) Data First willis creategenerating anew sustainableinsights bodyto inform the development of knowledgegovernment onpolicy and drive real progress in improving justice systemoutcomes.
General users,programme theirinformation
The interactionsData First user guide provides further information about the programme, including the processes for accessing the data for research. The privacy and data protection statement provides information about how we use and share data.
- (PDF, 951 KB, 36 pages)
- (PDF, 199 KB, 12 pages)
Datasets
Data catalogues are available for all Data First datasets, providing information on the variables contained within each. These data catalogues are currently draft versions that provide basic details of each dataset and will be updated soon with final versions.
Data First has shared six datasets from administrative sources across the criminal,courts, prison and probation services in England and Wales: magistrates’ courts, the Crown Court, prisoner custodial journeys, probation services, and the family and civil courts.
The cross-justice system linking dataset can be used to join these six different datasets at a person level. This linking dataset also contains a table which can be used to join magistrates’ courts and theirCrown needs,Court pathwaysdata at a case level.
Separately, data on criminal histories from the Police National Computer (PNC) have been linked to education and outcomessocial acrosscare adata rangein England from the DfE NPD as part of publicthe services.MOJ-DfE data share. Please contact DataLinkingTeam@justice.gov.uk or data.sharing@education.gov.uk for the latest available metadata for the MOJ-DfE data share.
MOJ cross-justice system datasets
- (ODS, 815 KB)
- (ODS, 777 KB)
- (ODS, 172 KB)
- (ODS, 155 KB)
- (ODS, 97.4 KB)
- (ODS, 97.4 KB)
- (ODS, 59.1 KB)
Applying for data access
ThisData willFirst providedatasets greatercan insightbe accessed through the ONS Secure Research Service (SRS) or SAIL Databank (except for the MOJ-DfE data share, which is only available through the ONS SRS).
Requests to informaccess data through the developmentONS SRS require completion of MOJthe policiesSecure Access to Data Form here: Application form for secure access to data
- For access to Data First datasets (covering courts, prisoner custodial journey and
driveprobationrealdatasets),progresstheinapplicationimprovingformsocialshould be submitted to datafirst@justice.gov.uk - Applications for the MOJ-DfE dataset should be directed to DataLinkingTeam@justice.gov.uk and data.sharing@education.gov.uk.
- For all MOJ justice
outcomes.datasets (except the MOJ-DfE dataset) applicants will also need to complete the Research Project Application form, which will be assessed by the UK Statistics Authority Research Accreditation Panel (RAP).
Guidance for completing the application form can be found in the Data Sharing Guidance, and the list of datasets and access routes can be found here. Further information on the process overall is included within the Data First user guide above.
TheTo programmeaccess data within the SAIL Databank please apply though SAIL.
A register of external research projects which have been approved to use MOJ data is ledavailable to view here.
Analytical outputs
Statistical and social research publications using Data First data have been delivered by MOJ andanalysts fundedor in collaboration with other government departments. Outputs have also been produced by ADR UKUK-funded (AdministrativeResearch Fellows. These publications can be found below:
- Data
ResearchFirst:UK),anCriminalinvestmentCourtsbyLinkedtheData - Education,
Economicchildren’sSocialsocial care andResearchoffending - Criminal
Councilcourts(ESRC).research fellows - Family court – Cafcass research fellows
- Probation and criminal justice system research fellows
- MOJ-DfE research fellows
Splink: Data linkage at scale
Through Data First, the MOJ has developed a free and open-source software library to enable data linkage at scale. This software has been used to link some of the largest datasets held by MOJ as part of Data First.
Splink is now in its third version. It is a freely available, open-source Python package that is:
- faster and more accurate than other free tools
- able to link
hugelarge datasets, of tens of millionsorof records or more - developed with advice from academic experts in data linkage
- able to produce a wide range of interactive data visualisations that help to build effective models, explain linkage predictions, diagnose problems and quality assure models
- compatible with multiple databases and big data processing engines, meaning it can run on a wider range of computer systems
You can find out more on the the Splink website, where you can download and start using Splink. You can also also ask us a question or or raise an issue on on the public public GitHub repository. We’d. Splink beare very happy to hear from researchers interested in using Splinkthe software for their work.
GeneralAwards projectand informationRecognition
Datasets
Analytical outputs
DataDataData’-
Collaboration
Education,Award,children’sAnalysissocialincareGovernmentandAwardsoffending
Application2024, form
MOJ: Data First,First application form for secure access to data Team
Contact
Contact the Data First team at at datafirst@justice.gov.uk if if you would like further information or have any queries.
Last updated
-
General user information has been updated to reflect new datasets and linkages. Updates to the User Guide and data catalogues will follow. The order of sections of the document has changed. New contact information has been added.
-
Splink information added.
-
Data First Family Court data catalogue updated.
-
Data First prisoner custodial journey data catalogue updated.
-
Analytical outputs section added.
-
User guide updated and Data First probation data catalogue, Data First criminal courts, prisons and probation linking data catalogue published.
-
User guide updated and Data First Family Court data catalogue published.
-
User guide, privacy statement, Data First magistrates' court defendant data catalogue, Data First Crown Court defendant data catalogue and Data First criminal courts and prisons linking data catalogue updated.
-
User guide updated and Data First prisoner custodial journey data catalogue published.
-
User guide updated and Data First linked magistrates’ and Crown Court data catalogue published.
-
Documents updated and Data First Crown Court defendant data catalogue published.
Update history
2024-09-17 14:46
An explanatory note has been added to two variables in the prisoner custodial journey data catalogue. An additional minor change has been made to correct a typing error identified in the User Guide.
2024-07-30 14:04
Added ‘Data First research bulletin July 2024’ to analytical outputs section.
2024-07-22 16:18
Updated Family Court and Cross-Justice Linking data catalogues have been added, along with an updated User Guide
2024-06-28 14:26
An additional section, ‘Limitations of data linking’, has been added to the main text.
2024-06-10 14:54
Updated magistrates’ courts data catalogue added. Outdated criminal courts and prisons linking data catalogue removed.
2024-05-20 10:30
Updating data catalogues for magistrates’ courts, Crown Court, prisoner custodial journey and probation datasets.
2024-05-10 10:31
General user information has been updated to reflect new datasets and linkages. Updates to the User Guide and data catalogues will follow. The order of sections of the document has changed. New contact information has been added.
2022-10-14 13:08
Splink information added.
2022-08-30 15:42
Data First Family Court data catalogue updated.
2022-08-15 10:40
Data First prisoner custodial journey data catalogue updated.
2022-03-18 12:33
Analytical outputs section added.
2022-02-22 08:26
User guide updated and Data First probation data catalogue, Data First criminal courts, prisons and probation linking data catalogue published.