CSE 1325: Digital Logic Design
Sidratul Tanzila Tasmi
Department of Computer Science and Engineering
United International University
1
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
011
100
101
110
2
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
89 Dec2Bin 011
100
0101 1001
101
110
3
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
Hi BCD 011
100
101
110
4
American Standard Code for Information
Interchange
Printing characters (94):
26 uppercase letters
E.g. : A,C
26 lower case letters
E.g. : a,c
10 numbers
E.g. : 1,2
32 special characters
E.g. : @,$
5
Why do we need alphanumeric codes?
Addresses 8 bit locations
000
001
010
01001000
Hi BCD 011 0110 1001
100
101
110
6
American Standard Code for Information
Interchange
Printing characters (94):
26 uppercase letters
E.g. : A,C
26 lower case letters
E.g. : a,c
10 numbers
E.g. : 1,2
32 special characters
E.g. : @,$
7
American Standard Code for Information Interchange
There are three kinds of control characters:
1.Format effectors
HT- If the computer finds this character in the memory (in binary form)
It will provide a horizontal tab. Addresses 8 bit locations
000
001
010
0100 1000(H)
H i 0000 1001(HT)
011
0110 1001(i)
100
101
110
Fig: RAM
2/6/2023
8
8
American Standard Code for Information Interchange
There are three kinds of control characters:
2.Information effectors
FS – file separator. Helps in separating multiple files
1000 0100
0000 1100
1010 1101
FS (0001 1100)
1000 0100
0000 1100
1010 1101
9
American Standard Code for Information Interchange
There are three kinds of control characters:
3.Communication Control Characters
STX – Start of text Note: The corresponding
binary
ETX – End of text numbers are being sent
Wants to say
“Hi!”
ST
X
10
Problem with ASCII
Problem 1
• ASCII is well suited for English language
• It can represent a total of 256 characters
• But what about this case?
Problem 2
• You can develop your own coding scheme (mapping) for your own language
ASCII
BSCII (in your computer)
A= 0100 0001
ক= 0100 0001
11
Problem with ASCII (Cont.)
You type an amazing But your friend’s computer
poem in Bangla . Your uses ASCII.
computer encodes it in
He will get something
the RAM using BSCII.
gibberish.
For example ক will be
You send it to your friend misrepresented as A.
12
Unicode
Grapheme
A single unit of human writing system.
“J” , “ক”, “大”
Codepoint
Values assigned to the Graphemes (just like they did for ASCII)
“J” = 74 “ক”= 2453 “大”= 12233
13
UTF 8
“J” = 74 (100 1010)
“ক”= 2453 (1001 1001 1001)
“大”= 12233 (101111 11001001)
No. of bits Representation
7 0xxx xxxx (J=0100 1010)
11 110x xxxx 10xx xxxx (ক= ‘cant be rep’)
16 1110 xxxx 10xx xxxx 10xx xxxx ক= 1110 0000 10 100110 10 011001
21 11110xxx 10xx xxxx 10xx xxxx 10xx xxxx
14
UTF 32
UTF 32
• “J” = 74 (100 1010)
• “ক”= 2453 (1001 1001 1001)
• “大”= 12233 (101111 11001001)
UTF 32 does not use variable number of bits.
Each character(grapheme) is converted into codepoints as usual (see above)
Then 4 bytes are assigned for each character:
J= 00000000 00000000 00000000 01001010
15
More Information:
https://en.wikipedia.org/wiki/Unicode
16
THANK YOU!
Reference:
Digital Logic and Computer Design. M. Morris Mano (1979).
17