-
Notifications
You must be signed in to change notification settings - Fork 13
Spurious "'s" and "'" in sentential_subject_island.jsonl #8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I think it's on purpose. The intended possessive construction is like "[Janet's talking during the play] was annoying." or "We didn't count on [their taking three hours for dinner]." So the meaning of the first "good" sentence is, "Who was alarmed by the fact that children were investigating Joel?" and the second, "Who were being astounded by the fact that those governments were talking about Jason?" This category stands out in the paper as the one with the weakest support from the crowd-sourced survey: humans only picked the "right" answer 61% of the time. In any case, the possessive of "children" is certainly not "children'". I think ultimately it's very hard to build an instrument with 67,000 questions. Some of the templates worked better than others. |
Ok, so the first sentence has a typo, thanks. But even after correcting for that, I still can't get a valid parse out of So, on that view, the first sentence would be: "Who has [children's investigating Joel] alarmed" and the second: Do you see an alternative parse? |
Ah, no, I think you're correct, these examples are wrong. |
Great, we are in agreement! Would you like me to make a pull request with the corrections? I can't do right now cause I'm preparing for ACL, but sometime "soon" :) |
It appears that many (all?) entries in sentential_subject_island.jsonl have a spurious possessive as part of the subject. For instance:
{'UID': 'sentential_subject_island',
'field': 'syntax',
'lexically_identical': True,
'linguistics_term': 'island_effects',
'one_prefix_method': False,
'pairID': '23',
'sentence_bad': "Who have children' investigating alarmed Joel.",
'sentence_good': "Who have children' investigating Joel alarmed.",
'simple_LM_method': True,
'two_prefix_method': False}
{'UID': 'sentential_subject_island',
'field': 'syntax',
'lexically_identical': True,
'linguistics_term': 'island_effects',
'one_prefix_method': False,
'pairID': '32',
'sentence_bad': "Who were those governments' talking about astounding Jason.",
'sentence_good': "Who were those governments' talking about Jason astounding.",
'simple_LM_method': True,
'two_prefix_method': False}
The text was updated successfully, but these errors were encountered: