“ـ” U+0640: Arabic Tatweel

U+0640: Technical Specifications for the Arabic Tatweel Character

The Arabic Tatweel (“ـ” U+0640) character, also known as Kashida, is a crucial formatting mark in the Arabic script. It is used to justify or elongate words for aesthetic and typographic purposes. This comprehensive resource provides developers and enthusiasts at iloveunicode.com with the complete technical data, encoding conversions, and usage context for this essential codepoint.

U+0640 Character Data and Unicode Properties

The Unicode codepoint U+0640 represents the Arabic Tatweel. This character is critical for proper text display and justification in Arabic, where it functions as a non-spacing, prolonging extension. It was officially added to Unicode in **version 1.1 (June, 1993)**. It belongs to the **Arabic** block of the **Basic Multilingual Plane**, serving a core function in bidirectional text rendering across modern systems.

Main Properties

Property Value
Name Arabic Tatweel
Unicode Codepoint U+0640
Unicode Version 1.1 (June, 1993)
Block Arabic
Plane Basic Multilingual Plane

Bidirectional and Other Data

Bidirectional Property Value
Bidirectional class Arabic Letter (AL)
Is mirrored? No
Other Data Property Value
Category Modifier Letter
Script Code for undetermined script
Combining Class Not Reordered

Encoding Conversions for U+0640

Encoding Type Value
HTML (decimal) ـ
HTML (hex) ـ
HTML (named) -
URL Escape Code %D9%80
CSS 00640
JavaScript, JSON u0640
C, C++, Java u0640
Python u0640
Rust u{0640}
Ruby u0640

UTF Byte Encodings (Hex)

Encoding Byte Sequence (Hex)
UTF-8 0xD9 0x80
UTF-16 0x0640
UTF-32 0x00000640

Code Snippet for Programming

To represent the **Arabic Tatweel** character in common programming languages, use the following escaped code:

u0640

Typing Instructions for U+0640

  • Windows: Hold Alt, then type 0 6 4 0. Release Alt.
  • Mac: Hold Alt ⌥, then type 0 6 4 0. Release Alt.

Preview of the Tatweel Character (ـ)

View how the **Arabic Tatweel** renders across different font families:

  • ـ (serif font example)
  • ـ (sans-serif font example)
  • ـ (monospace font example)
Characters Based on Arabic Tatweel (U+0640)

The U+0640 character is the base for several ligatures and presentation forms in the Arabic script:

Conclusion: Advancing Your Unicode Knowledge

Understanding specialized characters like the **Arabic Tatweel** (U+0640) is essential for accurate data handling and cross-platform compatibility, particularly in right-to-left scripts. The details of the U+0640 codepoint play a key role in typography, text rendering engines, and localization systems. With precise technical knowledge from iloveunicode.com, developers can achieve robust implementations and avoid encoding pitfalls for global audiences.

Leave a Reply

Your email address will not be published. Required fields are marked *