Menu

Commit [r866]  Maximize  Restore  History

Fixing longstanding id-order bug.

We had to rewrite some old patternlib code that was just not doing
the right thing. The semantics of id-order now works like this:
* For each id-order, print all words matching the pattern `p` with this ID.
* Exception: if a child of `p` has ignore="true", all grandchildren are * skipped.
An ignore="true" in a parent of `p` (or `p` itself) causes no skipping.

Basic testing example:
./bin/grep.py -e 'N{id=a} N{id=b}' --id-order="b:a" --to=PlainCorpus test/inputs/ted500.xml | head

silvioricardoc 2015-08-26

changed /bin/grep.py
changed /bin/libs/base/patternlib.py
changed /bin/libs/filetype/ft_bracketpattern.py
changed /bin/libs/filetype/ft_xml.py
changed /bin/view.py
changed /test/filetype-samples/embeddings.GloVe
changed /test/filetype-samples/embeddings.word2vec
changed /test/patterns/reference-output/candidates-idword-french.xml
changed /test/patterns/testAll.sh
/bin/grep.py Diff Switch to side-by-side view
Loading...
/bin/libs/base/patternlib.py Diff Switch to side-by-side view
Loading...
/bin/libs/filetype/ft_bracketpattern.py Diff Switch to side-by-side view
Loading...
/bin/libs/filetype/ft_xml.py Diff Switch to side-by-side view
Loading...
/bin/view.py Diff Switch to side-by-side view
Loading...
/test/filetype-samples/embeddings.GloVe Diff Switch to side-by-side view
Loading...
/test/filetype-samples/embeddings.word2vec Diff Switch to side-by-side view
Loading...
/test/patterns/reference-output/candidates-idword-french.xml Diff Switch to side-by-side view
Loading...
/test/patterns/testAll.sh Diff Switch to side-by-side view
Loading...
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.