Update
This commit is contained in:
parent
9c911cc430
commit
232c0266bf
21 changed files with 1928 additions and 1871 deletions
|
@ -79,4 +79,9 @@ similar (homoglyphs):
|
|||
ϹСⅭC𝙲ℂ𝐂𝕮𝒞𝗖
|
||||
```
|
||||
|
||||
**How to convert UTF-8 to ASCII?** [Easiest](kiss.md) way is to just filter out all bytes with the highest bit set, or, in other words, throw out all bytes with value higher than 127 (or maybe replace such bytes with question marks or something). This will possibly deform the text though, so it may be a last resort solution. Better (but of course still imperfect) results may be achieved by replacing Unicode characters by their ASCII [approximations](approximation.md) (e.g. the multiplaction symbol `×` by the letter `x` and so on), but this is non-trivial, a conversion table is needed -- thankfully there exist programs for doing this, e.g.: `cat unicodefile.txt | iconv -f utf-8 -t ascii//TRANSLIT`.
|
||||
**How to convert UTF-8 to ASCII?** [Easiest](kiss.md) way is to just filter out all bytes with the highest bit set, or, in other words, throw out all bytes with value higher than 127 (or maybe replace such bytes with question marks or something). This will possibly deform the text though, so it may be a last resort solution. Better (but of course still imperfect) results may be achieved by replacing Unicode characters by their ASCII [approximations](approximation.md) (e.g. the multiplaction symbol `×` by the letter `x` and so on), but this is non-trivial, a conversion table is needed -- thankfully there exist programs for doing this, e.g.: `cat unicodefile.txt | iconv -f utf-8 -t ascii//TRANSLIT`.
|
||||
|
||||
## See Also
|
||||
|
||||
- [ASCII](ascii.md)
|
||||
- [UTC](utc.md)
|
Loading…
Add table
Add a link
Reference in a new issue