SlideShare a Scribd company logo
IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518
513
Pioneering VDT Image Compression using Block Coding
P.S.Jagadeesh Kumar
Associate Professor, Department of Compute Science
University of Cambridge, United Kingdom
Abstract: This paper patronage a unique block-built compression system for VDT images. The
images are alienated to blocks, which are pigeon-holed into four dissimilar type blocks namely
smooth blocks, text blocks, hybrid blocks and picture blocks with a dynamic block-built taxonomy
system. Four dissimilar coding systems are castoff for respective block type bestowing to their
disparate arithmetical chattels; to exploit their compression recital. Imitations demonstrate that the
suggested exertion has subordinate intricacy than DjVu with substantially enhanced chromatic
eminence at high bit rate, and it also outclasses the standard lossy image coding scheme JPEG.
Keywords: Block-built Compression, Block Taxonomy, Coding algorithms, VDT images.
1 Introduction
With the prevalent of digital diplomacies such as digital cameras, private PCs, ever
more VDT images, comprehending text, graphics and natural images, are prevailing in
digital systems such as screen images, web pages. The compassion of human eyes for
natural image and text is not the same. The quality prerequisite of VDT image coding
[1] is dissimilar from overall image coding since users cannot assent the quality if text is
not vibrant enough to identify. How to compress VDT image is a stiff delinquent. A
plenty of algorithms have been premeditated to compress images with dissimilar types.
The Lempel-Ziv algorithm [2] is deliberate in compressing pure text images, which only
have text on the pure color background in the entire image. The JPEG algorithm [3] is
decorous for pure picture images which do not have any text in the entire images, but
has debauched enactment on pure text images. Quite a few algorithms were also
suggested to compress VDT images. One type of maneuvers for VDT images is layered
coding. Utmost layered coding algorithms use the typical three layer mixed raster
content exemplification. One prevalent method is DjVu, which practices a wavelet-
based codec (IW44) for background and foreground, and JB2 for mask layer.
Conversely, the intricacy of mask generating and IW44 wavelet transform is great,
which makes DjVu not fit for real time application. Moreover, DjVu shows debauched
concert on pure text images [4,5,6].
Block-built methodologies for VDT images are also willful for their low intricacy.
Said et al. anticipated a simple blocked-based scheme, which compresses text blocks
using JPEG-LS, picture blocks using JPEG [7]. Conversely, it nosedives to knob the
hybrid blocks, which comprises mixed text and pictures. In text area, there are sturdy
edges which cannot finger commendably by DCT centered coding such as JPEG. In this
paper, extant an unusual system, which can progressively compress images with diverse
content types, such as pure text images, pure picture images, and VDT images. Four
coding algorithms are premeditated for those blocks with disparate types based on a
dissolute block taxonomy method.
2 Block Coding System
2.1 Block-built compression
The VDT image is foremost divided into 16x16 blocks. The 16x16 block
dissection is done in-order to upturn the exactitude of segmentation as well as to
expedite superior taxonomy of blocks. Further 32x32 blocks may upsurge the intricacy
of block taxonomy. At that time, blocks are categorized into four types: smooth, text,
hybrid and picture according to their different arithmetic physiognomies. Blocks of
diverse type will be compressed with altered algorithms. The block type map is
compressed using an arithmetic coder.
IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518
514
2.2 Block taxonomy Algorithm
The block taxonomy algorithm is habitually a reckoning arduous job. Here
proposed a nippy and operative taxonomy algorithm based on two skins: histogram and
ascent of the block. The pixels of per capita block are primarily congregated into three
classes: low-ascent pixels, mid-ascent pixels and high-ascent pixels according to pixel’s
ascent value. Then the histogram dispersal of per capita pixel group is figured. The
emblematic ascent-histogram dissemination is shown in Fig.1. As witnessed in Fig.1,
blocks of dissimilar type spectacles unlike ascent-histogram distributions. At that
moment categorized the blocks into four types: smooth, text, hybrid and picture blocks
based on ascent-histogram dissemination. The smooth blocks archetypally comprehend
low-ascent pixels and show one peak at the low-ascent histogram. The text blocks
constantly show several peaks at the low-ascent and high-ascent histograms. Merely a
little mid-ascent pixels can be observed in text blocks. If the block comprehends large
numbers of high-ascent pixels and mid-ascent pixels, it will be recognized as hybrid
block. The blocks mostly consisting of mid-ascent pixels are blatant as picture blocks.
Here T1~T7 are thresholds for arbitrating the block types. The low, mid and high ascent
pixel values as well as threshold values are kept undisclosed for intellect reasons due to
their inconsistent nature and are prevailed in range for ease.
Fig. 1. The three histograms of text block (left), picture block (mid) and hybrid block (right)
The blocks of dissimilar type are discrete in general and have dissimilar
information disseminations. Smooth blocks are plane and subjugated by one benevolent
of color. Text blocks are denser in spatial domain than in DCT domain. The influence of
picture blocks is mainly strenuous on low frequency coefficients when they are DCT
transmuted. Hybrid blocks, covering mixed text and picture images, cannot be
efficiently epitomized both in spatial and frequency domain. Four coding algorithms are
prudently intended to compress blocks of dissimilar types effectually.
2.3 Smooth Block Coding
The coding of smooth blocks is candid. Smooth blocks are controlled by one color
and their gray level range is restricted to the assumed threshold. Every colors in smooth
blocks are quantized to the utmost recurrent color, which is veiled by an arithmetic
coder.
2.4 Text Block Coding
The text blocks are archetypally subjugated by numerous chief colors. The colors
with incidence above the assumed threshold are preferred as chief colors. If close by
over and above four colors sustaining beyond obligation, solitary the first four colors
with prime number in luminance histogram will be preferred as chief colors. The colors
adjacent to chief colors inside the assumed threshold are quantized to their conforming
chief colors. The color quantization algorithm can be described as shown in Fig.2.
IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518
515
Fig. 2. Color Quantization Algorithm
Each pixel’s color in the text block is initially transformed to color index. The chief
colors are indexed by 0, 1, 2 and 3 correspondingly. All the other colors are rehabilitated
to index 4. The chief colors in each block are chronicled.
Fig. 3. Text block coding contexts
The text block index is skimmed and compressed in a raster scanning order and the
present pixel index is coded based on its causal neighbors as shown in Fig.3 to endeavor the
spatial significance in order to advance coding efficacy. “X” is present pixel to be implied.
Each neighbor may be five different index values; there are total 5 125 3 contexts for coding
codin the present pixel X. The present pixel index is coded by an arithmetic coder using the
context specified by its three casual neighbors [NW, N, and W]. If the present pixel index is
4, the pixel value is also coded using an arithmetic coder.
Fig. 4. Text Block Coding
2.5 Hybrid Block Coding
Hybrid blocks comprise variegated text and pictures. There are robust high frequency
signals owing to text edges. DCT transform is only operative to condense the vigor of
low frequency signals, so the vigor in DCT domain of hybrid block is very dissimilar
and inflexible to code. When Hybrid blocks are compressed with DCT block transform
based coding such as JPEG, it will agonize from ringing effects about the text due to
huge quantization step for those high frequency constituents. Wavelet-based systems
such as JPEG-2K nosedive to compress hybrid blocks effectually.
IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518
516
Whereas hybrid blocks are compressed with document image algorithms, the coding
recital is too squat to be satisfactory. One elucidation to this delinquent is layered coding
such as DjVu. The text and pictures are detached into dissimilar layers and self-reliantly
coded. Proposed a haar wavelet based coding algorithm for hybrid blocks. As we know,
short wavelet bases are obliging to moderate the ringing effect around text (edge), and
longer bases are virtuous to advance the coding recital of the picture images. As a quid
pro quo between two chucks, haar wavelet is designated to code hybrid blocks. Though
haar wavelet’s poor recital on pure picture images, it can meritoriously confiscate the
ringing effect on text images. Its coding recital outclasses other coding algorithms both
in PSNR and visual quality. The hybrid block is first malformed with haar wavelet. Now
that only one level haar wavelet transform is castoff, subsequently multilevel haar
wavelet transform will produce long wavelet bases not appropriate for hybrid blocks.
The wavelet coefficients are then coded by a simple arithmetic coder. The coefficients of
dissimilar sub bands are veiled using unlike contexts. The simple haar wavelet algorithm
can suggestively advance the visual quality and PSNR of images with hybrid blocks.
The visual quality of rebuilt images of three methods is equated in Fig.5: coded by one
level haar wavelet, three levels haar wavelet and DCT. The PSNR-bitrate curves of the
three methods are shown in Fig.6. It is palpable that one level haar wavelet attains the
finest coding recital for hybrids blocks.
Fig. 5. Visual quality comparisons of reconstructed images
Fig. 6. PSNR-bitrate curves for hybrid blocks of three methods: haar one level,
haar three level and DCT.
2.6 Picture Block Coding
JPEG has been evidenced to be an effective low complexity algorithm to compress
picture images. Here proposed a JPEG-like algorithm to compress the picture blocks.
The dissimilarity is that it is needed to gambol the blocks of other types. As expected,
the system achieves analogous coding recital to jpeg on pure photographic images with
trivial overhead of blocks type map.
IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518
517
3 Experiment results
Five images are verified in our tryouts: VDT1, VDT2, VDT3, picture1 and text1,
which are revealed in Fig.7. VDT1~VDT3 are instances of typical VDT images with
mixed text and pictures; Text1 is a document image; Picture1 is the sample of pure
picture images. The coding efficacy between the proposed system and JPEG are equated
in Fig.8 for three of the VDT images. The proposed system undoubtedly outclasses jpeg
for all the images. In the pre-eminent circumstance, the proposed system outperforms
JPEG up to 7db for VDT2, which comprises a ration of hybrid blocks. Fig.9 relates the
visual quality of rebuilt images coded by JPEG, DjVu, and the proposed system at the
similar bitrate. It is apparent that the proposed system accomplishes much improved
visual quality than JPEG and also outclasses DjVu. The ringing affects round text in text
blocks and hybrid blocks are efficaciously detached by the proposed system.
Fig. 7. Test images: (a) VDT1, (b) VDT2, (c) VDT3, (d) Picuture1 and (e) Text1
Fig. 8. PSNR-bitrate curves for VDT1, VDT2 and
VDT3 of BB vs. JPEG
IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518
518
Fig. 9. Visual quality comparisons of portion of reconstructed images:
Top (VDT2) and Bottom (VDT3).
The proposed system is of low complexity since the algorithms castoff has less time
complexity. The taxonomy algorithm just customs the histogram of a block; the hybrid
coder practices haar wavelet; the text coder practices context-based arithmetic coding;
the most reckoning lavish part of scheme is picture coding, which use a JPEG-like
algorithm. In the vilest, the proposed system has analogous intricacy with JPEG.
4 Conclusion
In this exertion, a Block-built compression system for VDT image coding is acquiesced.
A profligate block taxonomy algorithm is suggested and four compression algorithms are
prudently deliberated for blocks of diverse kinds. The proposed system accomplishes
analogous or improved coding recital, on pure text images and pure picture images, to LZW
and JPEG respectively. The system outclasses DjVu on VDT images at high bitrate.
References
[1] “Coding of still images”, ITU-T, ISO/IEC SG 8 FCD 15444-6, 2004.
[2] J.Ziv and A. Lempel, “A universal algorithm for data compression”, IEEE
Transaction on Information Theory, IT-23(3), pp.337-343, May 2003.
[3] W. P. Pennebaker, J. L. Mitchell, “JPEG: Still image compression standard”, Van
Nostrand Reinhold, 2004.
[4] “Mixed Raster Content (MRC)”, ITU-T Recommendation T.44, Study Group-8
Contributions, 2001.
[5] P. Haffner, L. Bottou, P.G. Howard, P. Simard, Y. Bengio, Y. LeCun, “High Quality
document image compression with DjVu”, Journal of Electronic Imaging, pp.410-
425, July, 1998.
[6] B.-f Wu, C.-C Chiu and Y. –L Chen, “Algorithms for compressing compound
document images with large text/background overlap”, IEEE Proceedings on
visual image signal process, Vol. 151 No. 6, pp.453-459 December 1997.
[7] A.Said and A.Drukarev, “Simplified segmentation for compound image
compression”, Proceeding of ICIP’ 2004, pp.229-233.
BIOGRAPHY
P.S.Jagadesh Kumar is working as Associate Professor in the Department of
Computer Science, University of Cambridge, United Kingdom. He received his B.E
degree from University of Madras in EEE discipline by the year 1999. He obtained
his M.E degree in 2004 with specialization in CSE from Annamalai University,
Chidambaram, India.

More Related Content

What's hot (20)

PDF
Automated Neural Image Caption Generator for Visually Impaired People
Christopher Mehdi Elamri
 
PDF
EMPIRICAL STUDY OF ALGORITHMS AND TECHNIQUES IN VIDEO STEGANOGRAPHY
Journal For Research
 
PDF
Self-Directing Text Detection and Removal from Images with Smoothing
Priyanka Wagh
 
PDF
A Survey of Deep Learning Algorithms for Malware Detection
IJCSIS Research Publications
 
PDF
Ijarcet vol-2-issue-2-855-860
Editor IJARCET
 
PDF
3ways to improve semantic segmentation
Frozen Paradise
 
PDF
Inpainting scheme for text in video a survey
eSAT Journals
 
PDF
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
CSCJournals
 
PPTX
Texture features based text extraction from images using DWT and K-means clus...
Divya Gera
 
PDF
Image compression using negative format
eSAT Publishing House
 
PDF
Image compression using negative format
eSAT Journals
 
PDF
Comparison Between Levenberg-Marquardt And Scaled Conjugate Gradient Training...
CSCJournals
 
PPTX
Three classes of deep learning networks
Venkat Chaithanya Chintha
 
PDF
A1804010105
IOSR Journals
 
PDF
Ijetcas14 527
Iasir Journals
 
PDF
Selective image encryption using
csandit
 
PDF
Analysis of image storage and retrieval in graded memory
eSAT Journals
 
PPTX
Text extraction from natural scene image, a survey
SOYEON KIM
 
PDF
Image Encryption Based on Pixel Permutation and Text Based Pixel Substitution
ijsrd.com
 
PDF
37 c 551 - reduced changes in the carrier of steganography algorithm
Mohammed Kharma
 
Automated Neural Image Caption Generator for Visually Impaired People
Christopher Mehdi Elamri
 
EMPIRICAL STUDY OF ALGORITHMS AND TECHNIQUES IN VIDEO STEGANOGRAPHY
Journal For Research
 
Self-Directing Text Detection and Removal from Images with Smoothing
Priyanka Wagh
 
A Survey of Deep Learning Algorithms for Malware Detection
IJCSIS Research Publications
 
Ijarcet vol-2-issue-2-855-860
Editor IJARCET
 
3ways to improve semantic segmentation
Frozen Paradise
 
Inpainting scheme for text in video a survey
eSAT Journals
 
Shallow vs. Deep Image Representations: A Comparative Study with Enhancements...
CSCJournals
 
Texture features based text extraction from images using DWT and K-means clus...
Divya Gera
 
Image compression using negative format
eSAT Publishing House
 
Image compression using negative format
eSAT Journals
 
Comparison Between Levenberg-Marquardt And Scaled Conjugate Gradient Training...
CSCJournals
 
Three classes of deep learning networks
Venkat Chaithanya Chintha
 
A1804010105
IOSR Journals
 
Ijetcas14 527
Iasir Journals
 
Selective image encryption using
csandit
 
Analysis of image storage and retrieval in graded memory
eSAT Journals
 
Text extraction from natural scene image, a survey
SOYEON KIM
 
Image Encryption Based on Pixel Permutation and Text Based Pixel Substitution
ijsrd.com
 
37 c 551 - reduced changes in the carrier of steganography algorithm
Mohammed Kharma
 

Similar to Pioneering VDT Image Compression using Block Coding (20)

PDF
Block Classification Scheme of Compound Images: A Hybrid Extension
DR.P.S.JAGADEESH KUMAR
 
PDF
Improved block based segmentation for jpeg compressed document images
eSAT Journals
 
PDF
Image compression using embedded zero tree wavelet
sipij
 
PDF
Improved block based segmentation for jpeg
eSAT Publishing House
 
PDF
D017542937
IOSR Journals
 
PDF
Modified Skip Line Encoding for Binary Image Compression
idescitation
 
PDF
Image compression by ezw combining huffman and arithmetic encoder
IAEME Publication
 
PDF
Intelligent Parallel Processing and Compound Image Compression
DR.P.S.JAGADEESH KUMAR
 
PPT
notes_Image Compression.ppt
HarisMasood20
 
PPT
notes_Image Compression.ppt
HarisMasood20
 
PDF
Stegnography of high embedding efficiency by using an extended matrix encodin...
eSAT Publishing House
 
PDF
Stegnography of high embedding efficiency by using an extended matrix encodin...
eSAT Journals
 
PDF
PIXEL SIZE REDUCTION LOSS-LESS IMAGE COMPRESSION ALGORITHM
ijcsit
 
PDF
Non standard size image compression with reversible embedded wavelets
eSAT Journals
 
PDF
Non standard size image compression with reversible embedded wavelets
eSAT Publishing House
 
PDF
11.secure compressed image transmission using self organizing feature maps
Alexander Decker
 
PDF
I07015261
IJERD Editor
 
PDF
11.compression technique using dct fractal compression
Alexander Decker
 
PDF
Compression technique using dct fractal compression
Alexander Decker
 
Block Classification Scheme of Compound Images: A Hybrid Extension
DR.P.S.JAGADEESH KUMAR
 
Improved block based segmentation for jpeg compressed document images
eSAT Journals
 
Image compression using embedded zero tree wavelet
sipij
 
Improved block based segmentation for jpeg
eSAT Publishing House
 
D017542937
IOSR Journals
 
Modified Skip Line Encoding for Binary Image Compression
idescitation
 
Image compression by ezw combining huffman and arithmetic encoder
IAEME Publication
 
Intelligent Parallel Processing and Compound Image Compression
DR.P.S.JAGADEESH KUMAR
 
notes_Image Compression.ppt
HarisMasood20
 
notes_Image Compression.ppt
HarisMasood20
 
Stegnography of high embedding efficiency by using an extended matrix encodin...
eSAT Publishing House
 
Stegnography of high embedding efficiency by using an extended matrix encodin...
eSAT Journals
 
PIXEL SIZE REDUCTION LOSS-LESS IMAGE COMPRESSION ALGORITHM
ijcsit
 
Non standard size image compression with reversible embedded wavelets
eSAT Journals
 
Non standard size image compression with reversible embedded wavelets
eSAT Publishing House
 
11.secure compressed image transmission using self organizing feature maps
Alexander Decker
 
I07015261
IJERD Editor
 
11.compression technique using dct fractal compression
Alexander Decker
 
Compression technique using dct fractal compression
Alexander Decker
 
Ad

More from DR.P.S.JAGADEESH KUMAR (20)

PDF
Panchromatic and Multispectral Remote Sensing Image Fusion Using Particle Swa...
DR.P.S.JAGADEESH KUMAR
 
PDF
Bi-directional Recurrent Neural Networks in Classifying Dementia, Alzheimer’s...
DR.P.S.JAGADEESH KUMAR
 
PDF
Promise and Risks Tangled in Hybrid Wavelet Medical Image Fusion Using Firefl...
DR.P.S.JAGADEESH KUMAR
 
PDF
Optical Picbots as a Medicament for Leukemia
DR.P.S.JAGADEESH KUMAR
 
PDF
Integrating Medical Robots for Brain Surgical Applications
DR.P.S.JAGADEESH KUMAR
 
PDF
Automatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
DR.P.S.JAGADEESH KUMAR
 
PDF
Continuous and Discrete Crooklet Transform
DR.P.S.JAGADEESH KUMAR
 
PDF
A Theoretical Perception of Gravity from the Quantum to the Relativity
DR.P.S.JAGADEESH KUMAR
 
PDF
Advanced Robot Vision for Medical Surgical Applications
DR.P.S.JAGADEESH KUMAR
 
PDF
Pragmatic Realities on Brain Imaging Techniques and Image Fusion for Alzheime...
DR.P.S.JAGADEESH KUMAR
 
PDF
Intelligent Detection of Glaucoma Using Ballistic Optical Imaging
DR.P.S.JAGADEESH KUMAR
 
PDF
Robotic Simulation of Human Brain Using Convolutional Deep Belief Networks
DR.P.S.JAGADEESH KUMAR
 
PDF
Panchromatic and Multispectral Remote Sensing Image Fusion Using Machine Lear...
DR.P.S.JAGADEESH KUMAR
 
PDF
Classification and Evaluation of Macular Edema, Glaucoma and Alzheimer’s Dise...
DR.P.S.JAGADEESH KUMAR
 
PDF
Multilayer Perceptron Neural Network Based Immersive VR System for Cognitive ...
DR.P.S.JAGADEESH KUMAR
 
PDF
Computer Aided Therapeutic of Alzheimer’s Disease Eulogizing Pattern Classifi...
DR.P.S.JAGADEESH KUMAR
 
PDF
Congenital Bucolic and Farming Region Taxonomy Using Neural Networks for Remo...
DR.P.S.JAGADEESH KUMAR
 
PDF
Machine Learning based Retinal Therapeutic for Glaucoma
DR.P.S.JAGADEESH KUMAR
 
PDF
Fingerprint detection and face recognition for colonization control of fronti...
DR.P.S.JAGADEESH KUMAR
 
PDF
New Malicious Attacks on Mobile Banking Applications
DR.P.S.JAGADEESH KUMAR
 
Panchromatic and Multispectral Remote Sensing Image Fusion Using Particle Swa...
DR.P.S.JAGADEESH KUMAR
 
Bi-directional Recurrent Neural Networks in Classifying Dementia, Alzheimer’s...
DR.P.S.JAGADEESH KUMAR
 
Promise and Risks Tangled in Hybrid Wavelet Medical Image Fusion Using Firefl...
DR.P.S.JAGADEESH KUMAR
 
Optical Picbots as a Medicament for Leukemia
DR.P.S.JAGADEESH KUMAR
 
Integrating Medical Robots for Brain Surgical Applications
DR.P.S.JAGADEESH KUMAR
 
Automatic Speech Recognition and Machine Learning for Robotic Arm in Surgery
DR.P.S.JAGADEESH KUMAR
 
Continuous and Discrete Crooklet Transform
DR.P.S.JAGADEESH KUMAR
 
A Theoretical Perception of Gravity from the Quantum to the Relativity
DR.P.S.JAGADEESH KUMAR
 
Advanced Robot Vision for Medical Surgical Applications
DR.P.S.JAGADEESH KUMAR
 
Pragmatic Realities on Brain Imaging Techniques and Image Fusion for Alzheime...
DR.P.S.JAGADEESH KUMAR
 
Intelligent Detection of Glaucoma Using Ballistic Optical Imaging
DR.P.S.JAGADEESH KUMAR
 
Robotic Simulation of Human Brain Using Convolutional Deep Belief Networks
DR.P.S.JAGADEESH KUMAR
 
Panchromatic and Multispectral Remote Sensing Image Fusion Using Machine Lear...
DR.P.S.JAGADEESH KUMAR
 
Classification and Evaluation of Macular Edema, Glaucoma and Alzheimer’s Dise...
DR.P.S.JAGADEESH KUMAR
 
Multilayer Perceptron Neural Network Based Immersive VR System for Cognitive ...
DR.P.S.JAGADEESH KUMAR
 
Computer Aided Therapeutic of Alzheimer’s Disease Eulogizing Pattern Classifi...
DR.P.S.JAGADEESH KUMAR
 
Congenital Bucolic and Farming Region Taxonomy Using Neural Networks for Remo...
DR.P.S.JAGADEESH KUMAR
 
Machine Learning based Retinal Therapeutic for Glaucoma
DR.P.S.JAGADEESH KUMAR
 
Fingerprint detection and face recognition for colonization control of fronti...
DR.P.S.JAGADEESH KUMAR
 
New Malicious Attacks on Mobile Banking Applications
DR.P.S.JAGADEESH KUMAR
 
Ad

Recently uploaded (20)

PPTX
Element 11. ELECTRICITY safety and hazards
merrandomohandas
 
PDF
Design Thinking basics for Engineers.pdf
CMR University
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
PPTX
Introduction to Design of Machine Elements
PradeepKumarS27
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
PDF
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
PPTX
Lecture 1 Shell and Tube Heat exchanger-1.pptx
mailforillegalwork
 
PPTX
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
PPT
Carmon_Remote Sensing GIS by Mahesh kumar
DhananjayM6
 
PDF
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PPTX
Depth First Search Algorithm in 🧠 DFS in Artificial Intelligence (AI)
rafeeqshaik212002
 
PDF
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
PPTX
Damage of stability of a ship and how its change .pptx
ehamadulhaque
 
PPTX
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
DOCX
CS-802 (A) BDH Lab manual IPS Academy Indore
thegodhimself05
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
Element 11. ELECTRICITY safety and hazards
merrandomohandas
 
Design Thinking basics for Engineers.pdf
CMR University
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
Introduction to Design of Machine Elements
PradeepKumarS27
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
Lecture 1 Shell and Tube Heat exchanger-1.pptx
mailforillegalwork
 
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
Carmon_Remote Sensing GIS by Mahesh kumar
DhananjayM6
 
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
MRRS Strength and Durability of Concrete
CivilMythili
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
Depth First Search Algorithm in 🧠 DFS in Artificial Intelligence (AI)
rafeeqshaik212002
 
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
Damage of stability of a ship and how its change .pptx
ehamadulhaque
 
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
CS-802 (A) BDH Lab manual IPS Academy Indore
thegodhimself05
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 

Pioneering VDT Image Compression using Block Coding

  • 1. IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518 513 Pioneering VDT Image Compression using Block Coding P.S.Jagadeesh Kumar Associate Professor, Department of Compute Science University of Cambridge, United Kingdom Abstract: This paper patronage a unique block-built compression system for VDT images. The images are alienated to blocks, which are pigeon-holed into four dissimilar type blocks namely smooth blocks, text blocks, hybrid blocks and picture blocks with a dynamic block-built taxonomy system. Four dissimilar coding systems are castoff for respective block type bestowing to their disparate arithmetical chattels; to exploit their compression recital. Imitations demonstrate that the suggested exertion has subordinate intricacy than DjVu with substantially enhanced chromatic eminence at high bit rate, and it also outclasses the standard lossy image coding scheme JPEG. Keywords: Block-built Compression, Block Taxonomy, Coding algorithms, VDT images. 1 Introduction With the prevalent of digital diplomacies such as digital cameras, private PCs, ever more VDT images, comprehending text, graphics and natural images, are prevailing in digital systems such as screen images, web pages. The compassion of human eyes for natural image and text is not the same. The quality prerequisite of VDT image coding [1] is dissimilar from overall image coding since users cannot assent the quality if text is not vibrant enough to identify. How to compress VDT image is a stiff delinquent. A plenty of algorithms have been premeditated to compress images with dissimilar types. The Lempel-Ziv algorithm [2] is deliberate in compressing pure text images, which only have text on the pure color background in the entire image. The JPEG algorithm [3] is decorous for pure picture images which do not have any text in the entire images, but has debauched enactment on pure text images. Quite a few algorithms were also suggested to compress VDT images. One type of maneuvers for VDT images is layered coding. Utmost layered coding algorithms use the typical three layer mixed raster content exemplification. One prevalent method is DjVu, which practices a wavelet- based codec (IW44) for background and foreground, and JB2 for mask layer. Conversely, the intricacy of mask generating and IW44 wavelet transform is great, which makes DjVu not fit for real time application. Moreover, DjVu shows debauched concert on pure text images [4,5,6]. Block-built methodologies for VDT images are also willful for their low intricacy. Said et al. anticipated a simple blocked-based scheme, which compresses text blocks using JPEG-LS, picture blocks using JPEG [7]. Conversely, it nosedives to knob the hybrid blocks, which comprises mixed text and pictures. In text area, there are sturdy edges which cannot finger commendably by DCT centered coding such as JPEG. In this paper, extant an unusual system, which can progressively compress images with diverse content types, such as pure text images, pure picture images, and VDT images. Four coding algorithms are premeditated for those blocks with disparate types based on a dissolute block taxonomy method. 2 Block Coding System 2.1 Block-built compression The VDT image is foremost divided into 16x16 blocks. The 16x16 block dissection is done in-order to upturn the exactitude of segmentation as well as to expedite superior taxonomy of blocks. Further 32x32 blocks may upsurge the intricacy of block taxonomy. At that time, blocks are categorized into four types: smooth, text, hybrid and picture according to their different arithmetic physiognomies. Blocks of diverse type will be compressed with altered algorithms. The block type map is compressed using an arithmetic coder.
  • 2. IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518 514 2.2 Block taxonomy Algorithm The block taxonomy algorithm is habitually a reckoning arduous job. Here proposed a nippy and operative taxonomy algorithm based on two skins: histogram and ascent of the block. The pixels of per capita block are primarily congregated into three classes: low-ascent pixels, mid-ascent pixels and high-ascent pixels according to pixel’s ascent value. Then the histogram dispersal of per capita pixel group is figured. The emblematic ascent-histogram dissemination is shown in Fig.1. As witnessed in Fig.1, blocks of dissimilar type spectacles unlike ascent-histogram distributions. At that moment categorized the blocks into four types: smooth, text, hybrid and picture blocks based on ascent-histogram dissemination. The smooth blocks archetypally comprehend low-ascent pixels and show one peak at the low-ascent histogram. The text blocks constantly show several peaks at the low-ascent and high-ascent histograms. Merely a little mid-ascent pixels can be observed in text blocks. If the block comprehends large numbers of high-ascent pixels and mid-ascent pixels, it will be recognized as hybrid block. The blocks mostly consisting of mid-ascent pixels are blatant as picture blocks. Here T1~T7 are thresholds for arbitrating the block types. The low, mid and high ascent pixel values as well as threshold values are kept undisclosed for intellect reasons due to their inconsistent nature and are prevailed in range for ease. Fig. 1. The three histograms of text block (left), picture block (mid) and hybrid block (right) The blocks of dissimilar type are discrete in general and have dissimilar information disseminations. Smooth blocks are plane and subjugated by one benevolent of color. Text blocks are denser in spatial domain than in DCT domain. The influence of picture blocks is mainly strenuous on low frequency coefficients when they are DCT transmuted. Hybrid blocks, covering mixed text and picture images, cannot be efficiently epitomized both in spatial and frequency domain. Four coding algorithms are prudently intended to compress blocks of dissimilar types effectually. 2.3 Smooth Block Coding The coding of smooth blocks is candid. Smooth blocks are controlled by one color and their gray level range is restricted to the assumed threshold. Every colors in smooth blocks are quantized to the utmost recurrent color, which is veiled by an arithmetic coder. 2.4 Text Block Coding The text blocks are archetypally subjugated by numerous chief colors. The colors with incidence above the assumed threshold are preferred as chief colors. If close by over and above four colors sustaining beyond obligation, solitary the first four colors with prime number in luminance histogram will be preferred as chief colors. The colors adjacent to chief colors inside the assumed threshold are quantized to their conforming chief colors. The color quantization algorithm can be described as shown in Fig.2.
  • 3. IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518 515 Fig. 2. Color Quantization Algorithm Each pixel’s color in the text block is initially transformed to color index. The chief colors are indexed by 0, 1, 2 and 3 correspondingly. All the other colors are rehabilitated to index 4. The chief colors in each block are chronicled. Fig. 3. Text block coding contexts The text block index is skimmed and compressed in a raster scanning order and the present pixel index is coded based on its causal neighbors as shown in Fig.3 to endeavor the spatial significance in order to advance coding efficacy. “X” is present pixel to be implied. Each neighbor may be five different index values; there are total 5 125 3 contexts for coding codin the present pixel X. The present pixel index is coded by an arithmetic coder using the context specified by its three casual neighbors [NW, N, and W]. If the present pixel index is 4, the pixel value is also coded using an arithmetic coder. Fig. 4. Text Block Coding 2.5 Hybrid Block Coding Hybrid blocks comprise variegated text and pictures. There are robust high frequency signals owing to text edges. DCT transform is only operative to condense the vigor of low frequency signals, so the vigor in DCT domain of hybrid block is very dissimilar and inflexible to code. When Hybrid blocks are compressed with DCT block transform based coding such as JPEG, it will agonize from ringing effects about the text due to huge quantization step for those high frequency constituents. Wavelet-based systems such as JPEG-2K nosedive to compress hybrid blocks effectually.
  • 4. IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518 516 Whereas hybrid blocks are compressed with document image algorithms, the coding recital is too squat to be satisfactory. One elucidation to this delinquent is layered coding such as DjVu. The text and pictures are detached into dissimilar layers and self-reliantly coded. Proposed a haar wavelet based coding algorithm for hybrid blocks. As we know, short wavelet bases are obliging to moderate the ringing effect around text (edge), and longer bases are virtuous to advance the coding recital of the picture images. As a quid pro quo between two chucks, haar wavelet is designated to code hybrid blocks. Though haar wavelet’s poor recital on pure picture images, it can meritoriously confiscate the ringing effect on text images. Its coding recital outclasses other coding algorithms both in PSNR and visual quality. The hybrid block is first malformed with haar wavelet. Now that only one level haar wavelet transform is castoff, subsequently multilevel haar wavelet transform will produce long wavelet bases not appropriate for hybrid blocks. The wavelet coefficients are then coded by a simple arithmetic coder. The coefficients of dissimilar sub bands are veiled using unlike contexts. The simple haar wavelet algorithm can suggestively advance the visual quality and PSNR of images with hybrid blocks. The visual quality of rebuilt images of three methods is equated in Fig.5: coded by one level haar wavelet, three levels haar wavelet and DCT. The PSNR-bitrate curves of the three methods are shown in Fig.6. It is palpable that one level haar wavelet attains the finest coding recital for hybrids blocks. Fig. 5. Visual quality comparisons of reconstructed images Fig. 6. PSNR-bitrate curves for hybrid blocks of three methods: haar one level, haar three level and DCT. 2.6 Picture Block Coding JPEG has been evidenced to be an effective low complexity algorithm to compress picture images. Here proposed a JPEG-like algorithm to compress the picture blocks. The dissimilarity is that it is needed to gambol the blocks of other types. As expected, the system achieves analogous coding recital to jpeg on pure photographic images with trivial overhead of blocks type map.
  • 5. IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518 517 3 Experiment results Five images are verified in our tryouts: VDT1, VDT2, VDT3, picture1 and text1, which are revealed in Fig.7. VDT1~VDT3 are instances of typical VDT images with mixed text and pictures; Text1 is a document image; Picture1 is the sample of pure picture images. The coding efficacy between the proposed system and JPEG are equated in Fig.8 for three of the VDT images. The proposed system undoubtedly outclasses jpeg for all the images. In the pre-eminent circumstance, the proposed system outperforms JPEG up to 7db for VDT2, which comprises a ration of hybrid blocks. Fig.9 relates the visual quality of rebuilt images coded by JPEG, DjVu, and the proposed system at the similar bitrate. It is apparent that the proposed system accomplishes much improved visual quality than JPEG and also outclasses DjVu. The ringing affects round text in text blocks and hybrid blocks are efficaciously detached by the proposed system. Fig. 7. Test images: (a) VDT1, (b) VDT2, (c) VDT3, (d) Picuture1 and (e) Text1 Fig. 8. PSNR-bitrate curves for VDT1, VDT2 and VDT3 of BB vs. JPEG
  • 6. IEE Proceedings - Vision, Image and Signal Processing, 152 (4), August 2005, pp. 513-518 518 Fig. 9. Visual quality comparisons of portion of reconstructed images: Top (VDT2) and Bottom (VDT3). The proposed system is of low complexity since the algorithms castoff has less time complexity. The taxonomy algorithm just customs the histogram of a block; the hybrid coder practices haar wavelet; the text coder practices context-based arithmetic coding; the most reckoning lavish part of scheme is picture coding, which use a JPEG-like algorithm. In the vilest, the proposed system has analogous intricacy with JPEG. 4 Conclusion In this exertion, a Block-built compression system for VDT image coding is acquiesced. A profligate block taxonomy algorithm is suggested and four compression algorithms are prudently deliberated for blocks of diverse kinds. The proposed system accomplishes analogous or improved coding recital, on pure text images and pure picture images, to LZW and JPEG respectively. The system outclasses DjVu on VDT images at high bitrate. References [1] “Coding of still images”, ITU-T, ISO/IEC SG 8 FCD 15444-6, 2004. [2] J.Ziv and A. Lempel, “A universal algorithm for data compression”, IEEE Transaction on Information Theory, IT-23(3), pp.337-343, May 2003. [3] W. P. Pennebaker, J. L. Mitchell, “JPEG: Still image compression standard”, Van Nostrand Reinhold, 2004. [4] “Mixed Raster Content (MRC)”, ITU-T Recommendation T.44, Study Group-8 Contributions, 2001. [5] P. Haffner, L. Bottou, P.G. Howard, P. Simard, Y. Bengio, Y. LeCun, “High Quality document image compression with DjVu”, Journal of Electronic Imaging, pp.410- 425, July, 1998. [6] B.-f Wu, C.-C Chiu and Y. –L Chen, “Algorithms for compressing compound document images with large text/background overlap”, IEEE Proceedings on visual image signal process, Vol. 151 No. 6, pp.453-459 December 1997. [7] A.Said and A.Drukarev, “Simplified segmentation for compound image compression”, Proceeding of ICIP’ 2004, pp.229-233. BIOGRAPHY P.S.Jagadesh Kumar is working as Associate Professor in the Department of Computer Science, University of Cambridge, United Kingdom. He received his B.E degree from University of Madras in EEE discipline by the year 1999. He obtained his M.E degree in 2004 with specialization in CSE from Annamalai University, Chidambaram, India.