Re: [NTLK] extracting text

From: Robert Viragh (rviragh_at_yahoo.com)
Date: Fri Feb 25 2005 - 13:16:22 PST


well if you knew perl you'd just pipe a copy of the books through
perl -e 'while(<>){s/^.(..)*$/0$&/;($_=unpack([~#*],pack([#*],$_)))=~s'
-- yeah, yeah, that's a bit redundant, but at least it's
self-commenting.

Robert.
ps. obviously the unpack/pack solution leaves the pages creased, that's
why I say copies of the books - but it's superfast and cross-platform.
--- Owen Collins <collinso_at_wlu.edu> wrote:

> Is there a way that you can extract the text from a paperback book? I
> have a book I lost the source file for. It seems I can select the
> text
> on one page and copy it then paste it into a note. But repaeting it a
> hundred something times is going to get old.
>
> Thanks all,
>
> O.
> --
> This is the NewtonTalk list - http://www.newtontalk.net/ for all
> inquiries
> Official Newton FAQ: http://www.chuma.org/newton/faq/
> WikiWikiNewt for all kinds of articles:
> http://tools.unna.org/wikiwikinewt/
>
>

                
__________________________________
Do you Yahoo!?
Yahoo! Mail - Find what you need with new enhanced search.
http://info.mail.yahoo.com/mail_250

-- 
This is the NewtonTalk list - http://www.newtontalk.net/ for all inquiries
Official Newton FAQ: http://www.chuma.org/newton/faq/
WikiWikiNewt for all kinds of articles: http://tools.unna.org/wikiwikinewt/


This archive was generated by hypermail 2.1.5 : Fri Feb 25 2005 - 14:00:01 PST