Zanimivo orodje [Fwd: Re: [lingucomponent-dev] Soem help, please]
Robert Ludvik
robert.ludvik at zd-lj.si
Tue Jan 20 18:41:51 CET 2004
kot zanimivost pa za arhiv... orodje za generiranje wordlist.
lp
This directory provides the follwoing scripts
* get-words - extract words from PO, PDF, MO, HTML and Lynx DAT files
* new-words - eliminate existing words from a wordlist
* sort-length - provide a length sorted wordlist
Jancs wrote:
> On Wed, 14 Jan 2004 08:36:01 +0200
> Dwayne Bailey <dwayne AT translate.org.za> wrote:
>
>
>>I've written some tools that take for instance pdf or HTML documents
>>and extract words from them using bash. I used them to extract words
>>from
>
>
> would be interesting to have a look at.
You can have a look at:
http://cvs.sourceforge.net/viewcvs.py/translate/src/wordlist
More information about the lugos-slo
mailing list