Tuning Text Classification for Hereditary Diseases with Section Weighting
Supplementary information
Data sources
- Evaluation corpus (tar.gz, 9.5MB)
Contains 26 directories with up to 22 files each. 25 directories
represent hereditary diseases, and one contains 33 negative control
documents.
Please send any questions and requests to hakenberg(a)informatik.hu-berlin.de.