The 'Datasets' folder contains the sample data for the manuscript 'Developing a Raman spectroscopy-based tool to stratify patient response to pre-operative radiotherapy in rectal cancer'. The data for each sample is in .csv format, and includes serperate files containing the raw data and the pre-processed data. Raw dataset files (_datasetraw) contain the sample name, position of the silicon calibration peak (used to correct the data), the wavenumber positions and multiple columns that contain each raw Raman spectra collected from the sample. The pre-processed dataset files (_datasetanalysed) contain the sample name, wavenumber positions and multiple columns containing the pre-processed Raw spectra collected from each sample. Details of the pre-processing steps can be found in the manuscript. The 'EMSC references' folder contains the raw dataset files and pre-processed files of the references used as inputs for the EMSC correction of the sample data.