[OSM-dev] perl and special utf-8 characters
Gary68
gary at gary68.de
Sat Mar 19 08:52:10 GMT 2011
thank you, robert!!!
gerhard
On Sat, 2011-03-19 at 09:02 +0100, Robert Joop wrote:
> On 11-03-11 17:56:28 CET, Gary68 wrote:
> >
> > i wouldn't know how to do it with regexes.
> >
> > ok. let's put it this way. i have a POSITIVE list of allowed chars
> > (inkl. utf8 2byte ones) and i have a string.
> >
> > i want to eliminate all chars in the string that are not in the POSITIVE
> > list.
>
> sounds very simple.
>
> :r /tmp/g2
> use utf8;
>
> use Encode;
>
> my $s = 'abcäöüß$€✓XYZ';
> print "full: ", encode ('UTF-8', $s), "\n";
> $s =~ tr/\000-\377//cd;
> print "latin1 only: ", encode ('UTF-8', $s), "\n";
> $s =~ tr/\000-\177//cd;
> print "ASCII only: ", encode ('UTF-8', $s), "\n";
> __END__
>
>
> :r !perl /tmp/g2
> full: abcäöüß$€✓XYZ
> latin1 only: abcäöüß$XYZ
> ASCII only: abc$XYZ
>
> rj
More information about the dev
mailing list