IIIT Hyderabad Publications |
|||||||||
|
SCIMAT: Dataset of Problems in Science and MathematicsAuthors: Snehith Kumar Chatakonda,Neeraj Kollepara,Pawan Kumar Date: 2021-09-30 Report no: IIIT/TR/2021/121 AbstractDatasets play an important role in driving innovation in algorithms and architectures for supervised deep learning tasks. Numerous datasets exist for images, language translation, etc. One of the interesting challenge problems for deep learning is to solve high school problems in mathematics and sciences. To this end, a comprehensive set of dataset containing hundreds of millions of samples, and the generation modules is required that can propel research for these problems. In this paper, a large set of datasets covering mathematics and science problems is proposed, and the dataset generation codes are proposed. Test results on the proposed datasets for character-to-character transformer architecture show promising results with test accuracy above 95%, however,for some datasets it shows test accuracy of below 30%. Dataset will be available at: www.github.com/misterpawan/scimat2 Full report: pdf Centre for Security, Theory and Algorithms |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |