Fractal coding based on image local fractal dimension

Conci, Aura; Aquino, Felipe R.

Services on Demand

Article

Indicators

Cited by SciELO
Access statistics

Computational & Applied Mathematics

On-line version ISSN 1807-0302

Comput. Appl. Math. vol.24 no.1 Petrópolis Jan./Apr. 2005

Fractal coding based on image local fractal dimension

Aura Conci; Felipe R. Aquino

IC - Computer Institute, Department of Computer Science, UFF - Federal Fluminense University, r. Passo da Patria 156, 24210-240 Niteroi, RJ, Brazil, E-mails: aconci@ic.uff.br / felipe@lps.ufrj.br

ABSTRACT

Fractal codification of images is based on self-similar and self-affine sets. The codification process consists of construction of an operator which will represent the image to be encoded. If a complicated picture can be represented by an operator then it will be transmitted or stored very efficiently. Clearly, this has many applications on data compression. The great disadvantage of the automatic form of fractal compression is its encoding time. Most of the time spent in construction of such operator is due on finding the best match between parts of the image to be encoded. However, since the conception of automatic fractal image compression, researches on improvement of the compression time are widespread. This work aims to provide a new idea for decrease the encoding time: a classification of image parts based on their local fractal dimension. The idea is implemented on two steps. First, a preprocessing analysis of the image identify the complexity of each image block computing its dimension. Then, only parts within the same range of complexity are used for testing the better self-affine pairs, reducing the compression time. The performance of this proposition, is compared with others fractal image compression methods. The points considered are image fidelity, encoding time and amount of compression on the image file.
Mathematical subject classification: Primary: 68P30; Secondary: 94A08, 68U10, 68U05, 68P20, 65D15.

Key words: image coding system, fractal compression, image compression, PIFS codes, fractal dimension.

1 Introduction

Nowadays, the hardware to capture images turns possible the use of digital images with great tonal and spatial resolution. Image files now have a tendency to become too large if these possibilities must be used for improve the information quality (for example in biomedical or satellite applications). Moreover, the relatively low price of digital cameras and multimedia systems associated with the growth of the image databases and the intense use of nets enhance the use of image files.

A major objective of image coding is to represent digital images with as few bits as possible while preserving the level of intelligibility, usability or quality required for the application. Image compression has two great purposes: reduction of time for image transmission and reduction of storage area. The level of information quality required vary widely, depending on the special tasks. Several are the forms used to compact data, many of them can be used to compress images if after compression and reconstruction it must be exactly equal to the original [16]. Image coding techniques which do not destroy any information and which allow exact reconstruction of the original digital data are said to be information-preserving (or lost free). In such applications as image analysis from space programs and medical diagnosis, exactly regeneration of the original data is expected.

However, in applications such cellular phone image transmission, image retrieval in database or advertising on the WWW, some information in the original image may be destroyed without sacrifice the intelligibility to human viewer [14]. The more quality it is possible to sacrifice, the lower will be the required bit rate for transmission or storage. Image compression techniques which do not aim exact reconstruction of the original digital data are said to be lossy techniques [16]. These have been developed to exploit both the inherent characteristics of images and properties of human vision.

Image data are spatially correlated and preset local statistical self-dependence. Much redundancy and noise exist on picture material. So it is possible to reduce the image size considering the removal of these redundancy. It is more central to the human visual system to perceive and interpret, in a sense, the scene then really see the image information. For example only the outlines of an object (like a tree) are important for its identification. How humans can associate few draw elements with images is not clear and new researches have appeared in this direction. But this can be exploited in image compression improving the compression ratio.

Fractal compression employs the above mentioned aspect. It is based on self-similar and self-affine sets. The codification process consists of the construction of an operator which will represent the image to be encoded [1]. If a complicated image can be represented by an operator then the picture will be transmitted or stored very efficiently. The great disadvantage of the automatic form of fractal compression is its encoding time [2]. Most of the time spent in construction of such operator is due on finding the best match between parts of the image to be encoded (Figure 1). However, since the conception of automatic fractal image compression [11], researches on improvement of the compression time are widespread [7].

This work aims to provide a new idea for decrease the encoding time: a classification of image parts based on their local fractal dimension. The idea is implemented on two steps. First, a preprocessing analysis of the image identify the complexity of each image block computing its dimension. Then, only parts within the same range of complexity are used for testing the better self-affine pairs, reducing the compression time. The performance of this proposition, is compared with others fractal image compression methods. The points considered in this comparison are image fidelity, encoding time and amount of compression on the image file.

2 Encoding using FD for clustering

The fractal coding of an image f(N × N) , can be seen as an optimization problem [1]. To modeling this problem let E be the space of gray scale images. Then, we choose a space of functions f : X G, where the set X is taken as the set of spatial coordinates of the image and G represents the set of intensity values of this image. Also, let d be a metric, such that (E, d) is a complete metric space. Now the optimization problem is find a contractive mapping T on (E, d) whose fixed point f = T(f) exist and is unique (by the contractive mapping fixed point theorem) [2]. A detailed description of basic aspect of this theory and implementations of automatic approaches can be found in [5].

The decoding process consists of iterating the mapping T from any initial image until the iterates converge to the atractor - an approximation of the original image [1]. For complex images T is not a single function but a system known as Iterated Function System - IFS [2]. An extension for fully automatic compression programs is the Partitioned Iterated Function System - PIFS [11]. PIFS basic idea is simplify the problem of compress the whole image by partitioning the image into blocks and then seek self-similarity between larger parts and smaller parts. The nature of the PIFS is illustrades in figure 1. The encoding process consists of the construction of the operator T, which is a system of affine transformation, each one defined by matching the better couple of sets, formed from the original image [2].

Several researchers have taken up the challenge to improve the basic automated algorithm for compression [7, 8, 15]. Some of the main subjects addressed are compression ratio, time reduction and image fidelity [3, 12]. We show, in the folowing that, the time of compression can be improved without lost of the image fidelity considering the contractive factor and the local image complexity. Depending on the block complexity, different contractive factors have been used to form two groups on the image partitioned for compression [11].

Fractal dimensions (FD) have been used to obtain shape information and distinguish between smooth (small FD) and sharp (large FD) regions [6, 13, 14]. In this work we propose it to characterize local complexity in images. In other words, the partitioned image is preprocessed and each local FD is computed to generate separate pools depending on local image complexity. To derive the contractive transformation only the relation between elements on pools of same FD are considered (improving compression time and quality of reconstruction).

Let Ri and Dj be two subsets on f (the first called range is formed from partitioning n × n non-overlapping regions on X and the second called domain is formed also by subset of X but witch may overlap). Let t_i : Dj Ri be a contraction defined by matching the better t_i (Dj) for each Ri, which means that t_i is chosen such that the distance d (Ri, t_i(Dj)) is as small as possible (figure 2).

The operator T is given by

considering the contractive factor s of the Collage theorem [1] and fixed point A (attractor). Then

that is the distance between the original image, f, and its reconstruction, A, the attractor of the PIFS, has (by the same theorem) an upper bound that is a function of the contractive factor s of T.

Several papers have popularized the scheme where a digital image is partitioned into square range blocks (say n × n pixels) and larger square domain blocks, frequently twice the size of the range blocks [8]. This scheme makes the compression process simple, but it uses the contractive factor of T equal to s = . So by the Collage theorem [1, 2]:

We use s = or s = depending on the cluster - LFD. If the contraction of T is reduced to s = , or then, by the theorem, the upper bound to d(f, A) will decrease to:

and it is expected that the attractor, A, should look (each time more) quite like f, with these new set of contractions. This is the main idea of our work: the utilization of variable contractive factor, based on image complexity values.

Let D be a collection of subsets of from which D_j are chosen. D consistes of (m × m) pixels. The number of elements on D determines how much computation time the encoding takes on finding t_i(D_j) for each R_i. For encoding, the mapping t_i must be specified and the better domain squares D_j must be chosen from a set of potential candidates. The choice of the domain pool as 4n × 4n or 8n × 8n for each n × n range block (figures 3 and 4), reduces the computation required for the search and minimizes image quality reduction [2].

We use local fractal dimension (represented by LFD) to subdivide the blocks into classes of complexity (see figure 5). Some methods on LFD estimation do not give satisfactory results in all possible range for images (from 2 to 3). For fast LFD evaluation a simple and efficient algorithm has been used [6]. We use two levels of complexity, based on the LFD of each image part: 2 < LFD < 2.5 and 2.5 < LFD < 3.

On finding each t_i, the domain sets D_j must be chosen from a set of potential candidates, i.e. a set with the same LFD of the range sets, R_i. Although, real images present semi-fractal behavior, they are considered in this analysis ideal fractal models. Of course it must be considered when one is addressing to the problem of finding the real fractal dimension of the image, but here it is only a way for representing its local complexity [13].

3 Experimental results

Table 1 presents the results of this proposition on compression and reconstruction of 13 images. The efficiency of this image compression approach is quantified by considering both image fidelity and time reduction.

In this table, for no hardware dependece and installation interference on the results, the parameter related to the time spending on encoding is relative to the average results obtained on the same 13 images using the classical approach [2] (see table 2). Image fidelity is evaluated using 3 measures: the root mean square error, e_rms, the signal to noise ratio, SNR and the peak signal to noise ratio, PSNR of the each image. These are defined as:

where f(x, y) is the gray level on the original image, A(x, y) is the gray level on the reconstructed image (attractor), N is the number of pixels in each image direcion and p is the number of bits per pixel used for definition of the gray level (128 pixels and eight bits deep in the tested images).

Figure 6 compares image quality on reconstruction of six images from the set of 13 analyzed: Lena, LAX, Cameraman, Columbia, Goldhill and Couple. All results obtained with these 13 pictures can be seen in http://www.ic.uff.br/felipe/imgtest.html.

For comparison the same set is compressed by two other methods: the Heavy Brute Force Algorithm or Exhaustive Approach [2, 11] (table 2 and figure 7) and Restricted Area Search [12] (table 3 and figure 8). Figures 6-8 show also the result of subtraction: e(x, y) = (f (x, y) A(x, y)) , where f is the original image and A its reconstruction (third and fourth columns). On third column, for better verification the absolute value of these subtractions are amplify by a scale factor of 5. On fourth column, an offset of 127 (the middle of the gray level) is used to turn possible visualization of negative or positive values.

Considering now the compression ratio. All images have 128 × 128 pixels (=16384 pixels) and 256 gray level. They need 131072 bits or 16384 bytes and 8 bpp (bits per pixel) before compression.

In the Heavy Brute Force algorithm or Exhaustive Aproach [2, 11], the domain blocks are always square and always twice the size of range blocks. The range blocks are 4 × 4 pixels and the domain blocks are 8 × 8 pixels. The number of block to be coded is (128/4)²=1024. Each transformation can be coded using 23 bits (6 for locate de domain block in horizontal direction, plus 6 for locate the domain block in vertical direction, 3 bits for identification of the 8 symmetry and 8 bits to define the offset on intensity), then each compressed image can be stored as an text file of 23552 bits or 2944 bytes. On both, each image is represented by a file with 23552 bits. The compression ration for all images is = 1.4375 bpp (bits per pixel).

In the proposed approach, one bit more is needed for represent what will be the cluster of the block. Then each block needs 24 bits for its storage as an text file, and each image can be represented as 1024 * 24 = 24576 bits or 3072 bytes, in the more unfavorable case. So the worse compression ratio, using the proposed approach is = 1.5 bpp (bits per pixel).

The Restricted Area Search [12] approach considers a reasonable way of reducing the encoding time to restrict the search to nearby areas. For example, the source image may be sectioned into four quadrants (figure 9). For range block in the bottom left quadrant, say, only domain blocks in that same quadrat are search. As a result, if the better pair is not in the same quadrant a poor quality of compression/reconstruction can be obtained (see and compare the Cameraman image before and after reconstruction using Resitricted Area Search on third line of figure 8).

4 Conclusions

The principal goal in fractal compression has been maximum speed and quality. While still an open challenge, the presented approach is a significant advance in this direction. It is, on average, 15 time faster then early attempts (the Restricted Area Search is 4 times faster the classical approach). Moreover, when consider visual quality, the Error rms, the Signal to Noise Ratio rms and the Peak Signal to Noise Ratio there is not great difference between this and the classical approach. And it is clearly better than Restricted Area Search. The proposed aproach presents also no relevant increase on compression ratio.

In http://www.ic.uff.br/~aconci/compressao/fractal.html or http://www.ic.uff.br/~felipe/mdfl.html several methodologies suggested in other works (Light Brute Force, Local Sub Search, Local Search, Look Same Place Search, Double Canonic Classification, Canonic Classification through the Intensities and Double Canonic Classification with Quadtree of 3 levels) are compared with the here presented technique. Color images have also be tested (http://www.ic.uff.br/~felipe/imgcolor.html).

Acknowledgment

This work has been supported in part by the projects Protem-CC QualiIn and CNPq (Ref. 3024369/2004-9). The authors acknowledge the VisualLab (CTC-UFF) for the programming support.

REFERENCES

[1] M.F. Barnsley, Fractals Everywhere. 2nd ed., Academic Press, New York, (1993). [ Links ]

[2] M.F. Barnsley and L. Hurd, Fractal Image Compression. AK Peters, Wellesley, (1993). [ Links ]

[3] Chwen-Jye Sze, Hong-Yuan Mark Liao, Kuo-Chin Fan, Ming-Yang Chern and Chen-Kou Tsaoy, Fractal image coding system based on an adaptive side-coupling quadtree structures. Image and Vision Computing, 14 (1996), 401-415 [ Links ]

[4] A. Conci and F.R. Aquino, Using Adaptive Contraction for Fractal Image Coding Basedin Local Fractal Dimension. Proceedings of SIBGRAPI: the Brazilian Symposium on Computer Graphics, Image Processing and Vision, IEEE Computer Science, Campinas, SP, 231-239 (1999). [ Links ]

[5] A. Conci and F.R. Aquino, Codificação Fractal de Imagens. Proceedings of DINCON: I Congresso de Dinâmica, Controle e Aplicações, cap. 16, 365-402. Ed. by J.M. Balthazar, M. Boaventura, G.N. Silva e M. Tsuchida. SBMAC - ISBN-85-86883-05-0, São José do Rio Preto, SP (2002). [ Links ]

[6] A. Conci and C.F.J. Campos, An Efficient Box-Counting Fractal Dimension Approach for Experimental Image Variation Characterization. Proceedings of IWSIP'96 - 3rd International Workshop in Signal and Image Processing, Elsevier Science, Manchester, UK, 665-668 (1996). [ Links ]

[7] Y. Fisher, Fractal Compression: Theory and Application to Digital Images. Springer Verlag, (1994). [ Links ]

[8] J.C. Hart, Fractal Image Compression and Recurrent Iterated Function Systems. IEEE Computer Graphics and Applications, 25-40 (1996). [ Links ]

[9] R. Hamzaoui and D. Saupe, Combining Fractal Image Compression and Vector Quantization. IEEE Transactions on Image Processing. 9 (2000), 197-208. [ Links ]

[10] R. Hamzaoui, H. Hartenstein and D. Saupe, Local Iterative Improvement of Fractal Image Codes. Image and Vision Computing. 18 (2000), 565-568. [ Links ]

[11] A. Jacquin, Image Coding Based on a Fractal Theory of Iterated Contractive Image Transformations. IEEE Transactions on Image Processing. 1 (1992), 18-30. [ Links ]

[12] J. Kominek, Advances in Fractal Compression for Multimedia Applications. [ftp://links.uwaterloo.ca:/pub/Fractals/Papers/Waterloo/kominek95c.ps.gz] University of Waterloo, Internal report CS95-28 (1995). [ Links ]

[13] S. Morgan and A. Bouridane, Application of Shape Recognition to Fractal Based Image Compression. Proceedings of IWSIP'96-3rd International Workshop in Signal and Image Processing, Elsevier Science, Manchester, UK, 23-26 (1996). [ Links ]

[14] M. Nappi, G. Polese and G. Tortora, FIRST: Fractal Indexing and Retrieval SysTem for Image Databases. Image and Vision Computing. 16 (1998), 1019-1031. [ Links ]

[15] D. Saupe and R. Hamzaoui, A Review of Fractal Image Compression Literature. Computer Graphics, 28 (1994), 268-275. [ Links ]

[16] A. Watt and F. Policarpo, The Computer Image. ACM SigGraph series, Addison-Wesley, Essex, UK. (1998). [ Links ]

Received:05/I/04 . Accepted:09/XI/04

#596/04.