Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph
CODE
Github Repository of code repo link
PAPER
Please download the paper here paper link
AAAI 2018 SLIDES
Please download the slides here slides link
BIBTEX
@article{1801.10314,
Author = {Amrita Saha and Vardaan Pahuja and Mitesh M. Khapra and Karthik Sankaranarayanan and Sarath Chandar},
Title = {Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph},
Year = {2018},
Eprint = {arXiv:1801.10314},
}
LICENSE
This dataset is released under Creative-Commons license
DATASET
Please click here to download the dataset CSQA.
NEW: We have revised the dialogs after incorporating some more feedback from users. (DATED March 29, 2018).
NEW: Some slight renaming of JSON fields done in the dialog zip. (DATED March 15, 2018).
NEW: We have revised the dialog and wikidata jsons after incorporating feedback from several users. All users are requested to re-download the entire data inclusive of wikidata and dialog JSONs. (DATED March 6, 2018).
Please click here to download the dataset CQA.
This contains the subset of the QA pairs from the CSQA dataset, where the questions are answerable without needing the previous dialog context (Hence named Complex Question Answering i.e. CQA)
Please click here to download the dataset CQA_12K.
This is same as the above dataset, except its a smaller version, containing only 10K QA pairs for training and 1K for development and test set each. Each of the three splits are respectively subsets of the original train, development, test splits of the CQA dataset.
Two-Fold Challenges of this Dataset
-
- Complexities in KB-based Question Answering
- Some categories of questions cannot be answered from a single KB-tuple and needs inferencing over a much larger subgraph
- Different kinds of inferencing required; logical, quantitative, comparative as well as their combinations
- Large Scale of the Knowledge Base (consisting of millions of entities)
-
- Complexities in handling sequential Question Answering
- Use conversation context to resolve co-references and ellipsis in utterances
- Ask for clarifications for ambiguous queries