8000 GitHub - yujinee/scimat2
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

yujinee/scimat2

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 

Repository files navigation

SCIMAT: Science and Mathematics Dataset

Authors: Snehith, Neeraj, and Pawan, IIIT, Hyderabad, copyright 2021

Paper: SCIMAT: Dataset of Problems in Science and Mathematics, arXiv: 2109.15005, 2021, Paper

Remark on using the dataset

-You may use this dataset with Char2Char or Word2Word encoding in Transformer or in LSTM as shown in the paper above.

-You may also generate different variations of the questions using the generator files.

Information:

  • This is an open source project, so feeel free to help us grow this dataset!
  • For any dataset, the generator files can be used to generate the required number of samples.
  • Some existing range of values for quantities in generator file may not make sense, feel free to adapt those.
  • some part of Mathematics dataset is adapted from deepmind dataset, for more Mathematics data see: https://github.com/deepmind/mathematics_dataset
  • Some part of the codes for mathematics datasets were written by Pratik Mandlecha (Microsoft, Hyderabad)

If you find this useful, then please do cite our work as follows:

Neeraj Kollepara, Snehith Kumar Chatakonda and Pawan Kumar, SCIMAT: Science and Mathematics Dataset, arXiv: 2109.15005, 2021.

If you prefer Bibtex, then use the following:

 @misc{scimat,
  
  title={SCIMAT: Science and Mathematics Dataset}, 
  
  author={Neeraj Kollepara and Snehith Kumar Chatakonda and Pawan Kumar},
  
  year={2021},
  
  eprint={2109.15005},
  
  archivePrefix={arXiv},
  
  primaryClass={math.NA}
  
  }

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%
0