[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Can't pdftotext convert the the Arabic text?
- To: Development Discussions <developer at arabeyes dot org>
- Subject: Re: Can't pdftotext convert the the Arabic text?
- From: Nadim Shaikli <shaikli at yahoo dot com>
- Date: Sun, 31 Oct 2004 09:37:40 -0800 (PST)
- Comment: DomainKeys? See http://antispam.yahoo.com/domainkeys
- Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; b=GP3ZU7YgfHAwlTlxzbX8Gumi807gnDZHdpW94pUCPc7LU1EJ3DYHoI3MCn+gcCenXDEiTRuCxA7qzgVu451knosIFIRMrRdhsPLlPNo0uzgz2KLN5Gtua1DIaErQu1OBBkLol3bGEeyDUK/fbe6+J9upjXRq0lE/YhgL3N1Ws+U= ;
--- Munzir Taha <munzirtaha at newhorizons dot com dot sa> wrote:
> Has any one better luck than me? Any one knows what's is missing?
> Is it a known issue or do I need to file a bug?
I'm not familiar with 'pdftotext' (you should have provided a link
to thier homepage and/or authors - google didn't seem to provide
anything tangible) yet I have a couple of generic suggestions for ya,
1. Find an application that does lots of PDF to ... conversions
(to text, html, DOC, XML, etc) - the broader the better since
it will fill multiple needs in the future (if possible).
2. Contact the authors to see if Unicode/UTF-8 support is included
and/or forthcoming - "ask for it".
Once #2 is added, adding proper Arabic support should be much simpler
and we should be able to do that ourselves given interest.
In passing look into the various UTF-8 supporting search engines
(mnogosearch comes to mind) to see how they convert PDF's contents
to index 'em.
Salam.
- Nadim
__________________________________
Do you Yahoo!?
Take Yahoo! Mail with you! Get it on your mobile phone.
http://mobile.yahoo.com/maildemo