[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Submitted papers

To: <general at arabeyes dot org>
Subject: Re: Submitted papers
From: Behdad Esfahbod <behdad at bamdad dot org>
Date: Mon, 20 Aug 2001 16:35:00 +0430 (IRST)

On Mon, 20 Aug 2001, Chahine M. Hamila wrote:

> David Starner wrote:
>
> There I am not an expert and I wasn't aware of any problem with
> UCS-2. But what I meant when writing that above is either UCS-2 or
> UCS-4 invariably. Both are better in terms of internal processing
> in a program since each character takes a constant space in
> memory.

but UCS-2 does not encode characters above BMP!

> UTF-8 is good for storage or data exchange, but it
> multiplies complexity of many basic string functions by n.

Which ones? any examples?

I use UTF-32 internally and UTF-8 for transfers too, but just convert
from UTF-8 to UTF-32 that I really need, example: in my implementation
of the filter that performs bidi algorithm on console, first I check
for any non-ascii character in UTF-8 input, and if I find one, then
convert it to UTF-32, apply bidi, and convert the output to UTF-8
again...

-- 
Behdad
29 Mordad 1380, 2001 Aug 20

[Finger for Geek Code]

Follow-Ups:
- Re: Submitted papers
  - From: Chahine M. Hamila

References:
- Re: Submitted papers
  - From: Chahine M. Hamila

Prev by Date: Re: Submitted papers
Next by Date: new arabeyes.org look
Previous by thread: Re: Submitted papers
Next by thread: Re: Submitted papers
Index(es):
- Date
- Thread