Make patterns match multiple words
Reported by Will | January 5th, 2009 @ 01:39 PM | in 0.7
Patterns don't currently match multiple word sequences, but they
should.
They should also provide correct proportions in reports, e.g. if the pattern "A B" matches once in a document 10 words long then the proportion is 1/5 not 1/10
Comments and changes to this ticket
-
Will January 5th, 2009 @ 02:00 PM
- Milestone set to 0.7
-
Will January 5th, 2009 @ 02:18 PM
- State changed from new to open
-
Will January 9th, 2009 @ 11:42 AM
Patterns now match multiple words. The reporting has not yet been updated. Case-insensitive matching now works for non-ascii characters too (in RC4).
-
Will January 12th, 2009 @ 06:58 PM
Trying to figure out how to normalize reports with multi-word patterns has reminded me that I need to rethink the reporting structure.
I conclude that things should probably be expressed as rates rather than proportions and reported accordingly.
-
Will January 30th, 2009 @ 12:42 PM
On the one hand this is all now implemented (proportions, rates, or raw counts). On the other, it's not going to be ready for the 31st.
-
Bill Anderson-Samways June 4th, 2021 @ 07:43 PM
Hi Will - am using Yoshikoder for Mac but when I apply the dictionary it doesn't appear to be recording multiple-words terms (e.g. 'national interest', 'United States'). Could you please inform me how to use Yoshikoder to count instances of multiple-word terms?
Please Sign in or create a free account to add a new ticket.
With your very own profile, you can contribute to projects, track your activity, watch tickets, receive and update tickets through your email and much more.
Create your profile
Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป