Skip to content

Project Description

doodyparizada edited this page Aug 8, 2011 · 4 revisions

Short DEscription of what were gonna do, in steps:

  1. receive RTE6
  2. parse structure and create an object/list
  3. this might be done effortlessly with json.loads
  4. enumerate on hypothesis and given sentences
  5. for each input hypothesis and candidate sentence.
  6. for each word in the sentence (or hypo, or both?) create a vector with the entailing words (hyponyms)
  7. using wordnet and another resource
  8. try to match each word in the hypo to a word in the sentence
  9. this should be done by POS, ie. a noun in the hypo will only be matched to a noun in the sentence
  10. the previous step creates a feature vector, which will be used to decide whether the sentence entails the hypo
  11. this step can be done by guessing (with manual limits)
  12. or, using a machine learning approche. this depends on Ido Dagan's answer...
  13. mark V or X for each sentence
  14. at the end grade ourselves against the given answers to determine recall and precision

Things to decide:

  1. language of choice
  2. python
  3. java
  4. who does what
  5. system design
Clone this wiki locally