IIIT Hyderabad Publications |
|||||||||
|
SCIMAT: Dataset of Problems in Science and MathematicsAuthors: Snehith Kumar Chatakonda,Neeraj Kollepara,Pawan Kumar Conference: Big Data Analytics: 9th International Conference, BDA 2021 Pages: 1-16 Date: 2021-12-15 Report no: IIIT/TR/2021/114 AbstractDatasets play an important role in driving innovation in algorithms and architectures for supervised deep learning tasks. Numerous datasets exist for images, language translation, etc. One of the interesting challenge problems for deep learning is to solve high school problems in mathematics and sciences. To this end, a comprehensive set of dataset containing hundreds of millions of samples, and the generation modules is required that can propel research for these problems. In this paper, a large set of datasets covering mathematics and science problems is proposed, and the dataset generation codes are proposed. Test results on the proposed datasets for character-to-character transformer architecture show promising results with test accuracy above 95%, however,for some datasets it shows test accuracy of below 30%. Dataset will be available at: www.github.com/misterpawan/scimat2 Full paper: pdf Centre for Security, Theory and Algorithms |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |