CHAPTER 1: INFORMATION REPRESENTATION

1.1 DATA REPRESENTATION

1.1.1 Fundamental Characteristics of Number Systems

Every number system has two fundamental characteristics:

Base (Radix): The number of different digits that a system can use to represent numbers
Place Value: The specific value of a digit based on its position within a number

1.1.2 Denary (Decimal) System - Base 10

Uses digits 0-9
Each position represents powers of 10 (10⁰, 10¹, 10², etc.)
Example: 3,567 = (3 × 10³) + (5 × 10²) + (6 × 10¹) + (7 × 10⁰)

1.1.3 Binary System - Base 2

Key Points:

Uses only two digits: 0 and 1
Each bit (binary digit) represents a power of 2
All data and characters in computers are represented in binary

Binary Place Values:

<TEXT>

128 | 64 | 32 | 16 | 8 | 4 | 2 | 1
 2⁷ 2⁶ 2⁵ 2⁴ 2³ 2² 2¹ 2⁰

Example - Converting Denary to Binary:

Denary 65 in binary: 01000001
Calculation: 64 + 1 = 65

Example - Converting Binary to Denary:

Binary 01000001 = 64 + 1 = 65

1.1.4 Binary Prefixes vs Decimal Prefixes

It is crucial to understand the difference between binary prefixes (based on powers of 2) and decimal prefixes (based on powers of 10):

Denary Prefix	Factor	Value	Binary Prefix	Factor	Value
kilo- (k)	×10³	1,000	kibi- (Ki)	×2¹⁰	1,024
mega- (M)	×10⁶	1,000,000	mebi- (Mi)	×2²⁰	1,048,576
giga- (G)	×10⁹	1,000,000,000	gibi- (Gi)	×2³⁰	1,073,741,824
tera- (T)	×10¹²	1,000,000,000,000	tebi- (Ti)	×2⁴⁰	1,099,511,627,776

Important: Always use the correct prefix:

Computer storage uses binary prefixes (KiB, MiB, GiB, TiB)
Data transfer rates often use decimal prefixes (kbps, Mbps, Gbps)

1.1.5 Binary Coded Decimal (BCD)

Definition: Binary representation where each individual denary digit is represented by a sequence of 4 bits (nibble).

Characteristics:

Each nibble can represent denary digits 0-9
Uses only specific 4-bit patterns (0000 to 1001)
The patterns 1010 to 1111 are not used in BCD

Example - Converting 429 to BCD:

<TEXT>

4 = 0100
2 = 0010
9 = 1001
Therefore, 429 in BCD = 0100 0010 1001

Practical Applications:

Electronic devices displaying numbers (calculators)
Accurately measuring decimal fractions
Electronically coding denary numbers

1.1.6 Two's Complement Representation

Two's complement is used to represent negative numbers in binary.

Converting Negative Denary to Binary (Example: -42):

Step 1: Find binary equivalent (ignoring sign)

<TEXT>

42 = 00101010 (8-bit representation)

Step 2: Convert to one's complement (flip all bits)

<TEXT>

00101010 → 11010101

Step 3: Add 1 to get two's complement

<TEXT>

11010101 + 1 = 11010110

Converting Binary Two's Complement to Denary (Example: 11010110):

Step 1: Flip all bits

<TEXT>

11010110 → 00101001

Step 2: Add 1

<TEXT>

00101001 + 1 = 00101010

Step 3: Convert to denary and apply negative sign

<TEXT>

00101010 = 42
Therefore: -42

Range in 8-bit Two's Complement:

Maximum positive: +127 (01111111)
Maximum negative: -128 (10000000)

Overflow:

Occurs when the result of an arithmetic operation is too large/small to fit in the allocated bits
Example: Adding 127 + 1 in 8-bit gives -128 (overflow)

1.1.7 Hexadecimal System - Base 16

Characteristics:

Uses digits 0-9 and letters A-F
A=10, B=11, C=12, D=13, E=14, F=15

Converting Denary to Hexadecimal: Example: 165 to Hex

<TEXT>

165 ÷ 16 = 10 remainder 5
10 = A
Therefore: 165 = A5 (hex)

Converting Hexadecimal to Denary: Example: A5 to Denary

<TEXT>

A5 = (10 × 16) + (5 × 1) = 160 + 5 = 165

Practical Applications:

Defining colours in HTML (#FF0000 = red)
Defining MAC addresses
Assembly languages and machine code
Debugging via memory dumps

1.1.8 Character Sets and Encoding

Definition: A character set is a collection of characters that can be represented using binary codes. It typically includes upper and lower case letters, number digits, punctuation marks, and other characters.

Character Encoding Standards:

Standard	Description	Bits per Character	Characters
ASCII	American Standard Code for Information Interchange	7 bits	128
Extended ASCII	Extension of ASCII	8 bits	256
Unicode	Superset of ASCII and extended ASCII	16 or 32 bits	65,536+

ASCII:

Only supports English alphabet
7 bits = 128 possible characters
Includes control characters (0-31), printable characters (32-126)

Extended ASCII:

8 bits = 256 possible characters
Includes most European languages' alphabets
Still limited for global languages

Unicode:

Modern international standard
Supports all global languages
UTF-8 uses 1-4 bytes per character
Backward compatible with ASCII

1.2 MULTIMEDIA - GRAPHICS AND SOUND

1.2.1 Bitmap Images

Definition: Bitmap images are created by assigning a solid colour to each pixel using bit patterns. The image is represented as a grid of pixels, where each pixel's colour is encoded using binary values.

Key Terms:

Pixel: The smallest picture element whose colour can be accurately represented by binary code
File Header: Contains metadata including image size, number of colours, etc.

Image Resolution:

Definition: The number of pixels that make up an image
Example: 4096 × 3192 pixels
Effect: Higher resolution results in sharper, more detailed images

Screen Resolution:

Definition: The number of pixels that can be viewed horizontally and vertically on a device's screen
Example: 1680 × 1080 pixels

Colour Depth:

Definition: The number of bits used to represent the colour of a single pixel
Formula: If n bits are used, there are 2ⁿ colours per pixel
Example: 16-colour bitmap = 4 bits per pixel (2⁴ = 16)
Effect: Increasing colour depth improves colour quality but increases file size

File Size Calculation:

<TEXT>

File Size = Number of Pixels × Colour Depth

Example Calculation:

<TEXT>

Image: 1024 × 768 pixels, 24-bit colour
Number of Pixels = 1024 × 768 = 786,432
Colour Depth = 24 bits
File Size = 786,432 × 24 = 18,874,368 bits
 = 18,874,368 ÷ 8 = 2,359,296 bytes
 ≈ 2.36 MB

Applications:

Scanned images
Digital photographs
Computer screen displays
Small file sizes and easy manipulation when needed

1.2.2 Vector Graphics

Definition: Made up of drawing objects (mathematically defined constructs like rectangles, lines, circles, curves).

Components:

Drawing List: A set of commands defining the vector
Properties: Basic geometric data determining shape and appearance
Encoding: Data is encoded using mathematical formulas

Advantages over Bitmap:

Objects can be resized without losing quality
Scalability is the key benefit
Smaller file sizes for simple images
Can be enlarged infinitely without pixelation

Disadvantages:

Cannot represent complex images like photographs
More complex to create

Applications:

Company logos
Architectural drawings
Icons and symbols
Fonts (TrueType, PostScript)

1.2.3 Sound Representation

Analogue vs Digital:

Analogue	Digital
Continuous electrical signals	Discrete electrical signals
Infinite detail	Finite representation
Cannot be stored directly	Can be stored in binary

Sound as Analogue Data:

Sound consists of vibrations through a medium
Inherently analogue due to infinite detail variation

Conversion Process (Analogue to Digital):

Sampling: The sound wave's amplitude is measured at set time intervals
Quantization: Each sample is assigned a binary value
Encoding: Binary values are stored

Key Terms:

Sampling Rate: Number of samples taken per unit of time (measured in Hz)
- Effect: Increasing sampling rate improves accuracy but increases file size
- CD quality: 44,100 Hz
Sampling Resolution: Number of bits used to encode each sample
- Effect: Increasing resolution improves accuracy but increases file size
- CD quality: 16 bits
Bit Rate: Number of bits used to store 1 second of sound
- Formula: Bit Rate = Sampling Rate × Sampling Resolution
- Example: 44,100 × 16 = 705,600 bps (approximately 706 Kbps)

1.3 COMPRESSION

1.3.1 Need for Compression

Definition: Compression is the process of reducing file size without significant loss in quality.

Benefits:

Reduced storage requirements
Faster data transfer (uses less bandwidth)
Reduced time needed to search for data

1.3.2 Lossless Compression

Definition: A type of compression that allows original data to be perfectly reconstructed from the compressed file.

Key Feature:

Uses some form of replacement (substitution)
No data is permanently deleted

Examples:

PNG images (for graphics with sharp edges)
ZIP files
Text file compression
Database records
Run-Length Encoding (RLE)

Run-Length Encoding (RLE):

Definition: A form of lossless compression used for compressing text files and bitmap images.

Mechanism:

Reduces file size by encoding sequences of adjacent, identical elements
Encodes as two values: run count and run value

Example: Original: AAAAAAABBBBBCCCCCC Compressed: 7A5B6C

Example - Bitmap: Original row: White White White White White Black Black Compressed: 5W2B

Applications:

Simple graphics with large areas of same colour
Database records with repeated values

1.3.3 Lossy Compression

Definition: A type of compression that irreversibly eliminates unnecessary data.

Characteristics:

File accuracy/quality is lower than lossless
File size is significantly reduced (often to about 10% of lossless size)
Some original data is permanently lost

Examples:

MP3 (sound files)
JPEG (images)
MP4 (video files)

Mechanism in Sound Files (MP3):

Perceptual Coding: Removes parts of the sound that are less audible or discernible to human hearing
Removes frequencies outside human hearing range
Removes subtle volume differences

Mechanism in Images (JPEG):

Removes high-frequency details
Uses mathematical approximations
Reduces colour precision in less important areas

When to Use Lossy vs Lossless:

Lossless	Lossy
Text documents	Photography
Database files	Video streaming
Program files	Music (streaming)
Spreadsheets	Web graphics (where size matters)