State-of-the-Art Trends in Data Compression: COMPROMISE Case Study
<p>The COMPROMISE feature-based data compression encoding/decoding concept.</p> "> Figure 2
<p>A variant of the COMPROMISE concept with lossy compression replacing feature extraction.</p> "> Figure 3
<p>A draft of a compressive-sensing variant of the COMPROMISE concept.</p> "> Figure 4
<p>Interval features with linear approximation between two successive distinct extremes in a 1D (digital audio) example.</p> "> Figure 5
<p>An interval feature with the grid-based mask polyline approximation between two successive distinct extremes in a 1D (digital audio) example.</p> "> Figure 6
<p>Compressing digital audio with the COMPROMISE variant of replacing feature extraction with lossy compression.</p> ">
Abstract
:1. Introduction
- An overview of the challenges driving the recent progress in data compression.
- A critical overview of trends in data compression and how they relate to specific challenges.
- Possibly the first proposal for a formal definition of near-lossless compression, which was occasionally addressed but not clearly specified in the literature in different contexts in the past.
- The introduction of the COMPROMISE paradigm and, within it, the following innovations:
- -
- Generality, which we achieved by a unified taxonomy of (generic) features to which their domain-specific descriptions can be linked.
- -
- Universality in the form of a uniform concept to allow lossy, lossless, and near-lossless compression within a common framework.
- -
- The use of interpolation of intermediate values and not only an extrapolation of upcoming values. Dynamic programming and artificial intelligence techniques can be used as an alternative to greedy methods to optimize predictions. Another advantage of feature-based prediction is that the interpolation parameters are stored losslessly in the feature description, which prevents the accumulation of decoding errors. In the case of extrapolation, a lossy decoded sample is an input to the prediction of the next sample, which obviously means that errors accumulate.
- -
- Presenting information in a way that is more suitable for human interpretation or machine processing. Access to semantic features is often easier from a compressed stream than from raw data, which significantly supports reusability.
- -
- Finally, the most innovative outcome is certainly the design that integrates all these innovative partial solutions into a single common methodology. That is, all the trends identified in the paper are highly interdependent and require a holistic approach to integrate them into new paradigms. To the best of our knowledge, a data compression method with such an ambitious set of objectives does not yet exist. Feature-based compression and domain independence seem particularly incompatible.
2. Actual Trends in Data Compression
2.1. Feature-Based Data Compression
2.2. Unified Concept of Lossless, Near-Lossless and Lossy Compression
2.3. Data Restoration
2.4. Data Compression and Machine Learning
2.5. Domain-Independent Data Compression
3. COMPROMISE
- To develop a general and universal data compression methodology with a unified taxonomy of features from diverse domains, and a common framework for lossless, near-lossless, and lossy compression.
- To upgrade the predictions of original data by integrating the techniques of feature selection and data restoration.
- To improve the compression ratios in lossless and near-lossless modes in comparison with the existing approaches.
- To improve the accessibility and reusability of features. Access to semantic features is often easier from a compressed stream than from raw data and feature-based restoration.
- To verify the paradigm in four pilot domains: raster images, digital audio, biomedical signals, and sparse voxel grids. During the project, the research has been extended to complete voxel grids, point clouds, and triangulated surfaces (static, dynamic, and time-varying). COMPROMISE’s results to date have been published in the following journal articles: refs. [13,79] on general issues (entropy coding and feature selection, respectively), refs. [80,81,82] on images, and ref. [83] on meshes.
3.1. COMPROMISE Concept
3.2. Unified Taxonomy of Generalized Features
- Integers in the positional notation are highly convenient for compression. If transformed to non-negative values, residuals with multiple leading zeros, which may be simply omitted, are often obtained.
- Floating point numbers in the positional notation can be considered in the same way as integers. However, the scientific notation, e.g., IEEE 754, appears to be slightly more complex. Here, the replacement of an input value by a residual potentially lowers the exponent and shortens the mantissa by shifting its non-zero digits to the left.
- Multiple attributes, e.g., RGB color components or a pair of stereo audio samples, may also be attached to a single data stream sample or, alternatively, streams of individual components may be considered as separate data streams.
- Samples without values are also possible. This makes sense if we are interested in a geometric shape only, i.e., the samples that do represent the region of interest. This corresponds to the complete-grid data stream of Boolean values 0 and 1, where the non-shape samples (e.g., 0-values) are omitted.
- Features in F may overlap, while the features in do not.
- Each sample is addressed by exactly one feature.
- Domain-dependent patterns of samples in may be interrupted by gaps, and the patterns in and the corresponding patterns of residuals in R represent connected intervals.
- Samples in may be arranged into 1D, 2D, or 3D connected or sparse structures, while those in and R are always arranged into 1D intervals (lists, segments, or substreams).
- f.interpretation: definitions of the presence and structure of data in the other two components. These are actually instructions for the data restoration module of the decoder, enabling it to develop linear streams of features and residuals into the corresponding 1D, 2D, or 3D structures of the restored output stream . Furthermore, also stores the information on whether represents the feature with its boundary, the interior, key samples, or some combination of these. Finally, in the lossy and near-lossless mode, the information on entering the losses must be defined, i.e., the quantization and/or pattern subsampling parameters.
- : the sequence of samples from I affected by the feature. These samples might be coded directly in , in the corresponding part of the residuals’ stream R, or as a combination, where the former contains the feature control samples and the latter encodes the remaining ones.
- : unambiguous rules, which determine how the feature affects samples from .
- -
- The interpolation function interpolates key samples and/or values on the segment border/box. The goal is to find such an interpolation that the residuals of the estimated sample values are ‘optimal’ (with the lowest entropy already or best compressible to achieve the lowest entropy).
- -
- The approximation function approximates samples of a given segment in an “optimal” way (best-fitting curve/surface). Key samples and or the segment border/box are used to define the control points to be fitted.
- -
- The extrapolation function predicts values of the observed pattern by using the values from some predefined neighboring pattern.
For simplicity, we will survey some of these functions in the audio example in Section 3.3, where the interpretation is the simplest due to the 1D nature of the signal.
3.3. Examples of Feature Interpretation in Digital Audio Compression
- Linear interpolation—the interval values are predicted with the line segment between the interval border values (see Figure 4).
- Average value approximation—all the values in the interval are predicted with the average value computed on the entire interval.
- The grid-based polyline interpolation (Figure 5) is based on a so-called mask, a predefined pattern of a few sample locations in a considered interval, aiming to locally attract the graph of the considered prediction function. At each mask location, a limited repertoire of sample values is offered, and the closest to the concrete sample value is chosen for the calculation of the residual. As there is a low number of mask locations and also a low number of predefined values at these locations, the mask can be compactly represented by only a few bits.
- Verbatim—lossless mode, where the interval values are explicitly listed.
- RLE—lossless mode, where the repeated value and the number of repetitions are given.
- Black-box-feature prediction—this is the implementation of a modified concept from (Figure 2, where a lossy compressed file replaces the feature set .
- Data restoration—when an obvious trend exists in a sequence of the interval values, then the feature may be represented with a few initial values and the last value, while the missing intermediate values can be restored. The example is explained in Section 2.3.
4. Discussion and Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
AAC | Advanced Audio Coding |
ALS | Audio Lossless Coding |
APE | Monkey’s Audio lossless file format |
ARJ | Archived by Robert Jung |
ASCII | American Standard Code for Information Interchange |
BASC | Binary Adaptive Sequential Coding |
CNN | Convolutional Neural Network |
COMPROMISE | Data Compression Paradigm Based on Omitting Self-Evident Information |
CPU | Central Processing Unit |
DCT | Discrete Cosine Transform |
DEFLATE | lossless data compression combination of LZ77 and Huffman coding |
DNA | Deoxyribonucleic acid |
ECG | Electrocardiography |
EED | Edge-enhancing anisotropic diffusion |
EEG | Electroencephalography |
FLAC | Free Lossless Audio Codec |
GAN | Fenerative Adversarial Network |
HEVC | High Efficiency Video Coding |
HPC | High-Performance Computing |
ICT | Information-Communication Technology |
IEEE | Institute of Electrical and Electronics Engineers |
IoT | Internet of Things |
JPEG | Joint Photographic Experts Group and their lossy image compression |
JPEG 2000 | JPEG and their wavelet-based image compression algorithm |
JPEG-LS | JPEG Lossless image compression algorithm |
LZ77 | Lempel–Ziv universal lossless data compression algorithm from 1977 |
LZW | Lempel–Ziv–Welch universal lossless data compression algorithm |
MDCT | Modified Discrete Cosine Transform |
MGARD | MultiGrid Adaptive Reduction of Data |
MGCC | Multiple Grid Chain Code |
MP3 | MPEG-1 Audio Layer III or MPEG-2 Audio Layer III |
MPEG | Moving Picture Experts Group |
MwI | Move with Interleaving transform |
OGG | Ogg Vorbis lossy audio file format |
PDE | Partial Differential Equation |
PNG | Portable Network Graphics |
RAR | Roshal Archive |
R-EED | EED within a rectangular subdivision |
RGB | Red, Green, Blue color model |
RLE | Run-Length Encoding |
RNN | Recurrent Neural Network |
SD | Standard Deviation |
SZ | Lossy compressor framework for scientific data |
VAE | Variational Autoencoders |
ZB | Zettabyte |
ZFP | A very fast compressor and decompressor for floating-point data in HPC |
ZIP | Universal lossless archive file format developed by Phil Katz |
References
- Salomon, D.; Motta, G. Handbook of Data Compression; Springer Science & Business Media: London, UK, 2010. [Google Scholar]
- Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
- Huffman, D.A. A method for the construction of minimum-redundancy codes. Proc. Inst. Radio Eng. 1952, 40, 1098–1101. [Google Scholar] [CrossRef]
- Vitter, J.S. Design and analysis of dynamic Huffman codes. J. ACM 1987, 34, 825–845. [Google Scholar] [CrossRef]
- Langdon, G.G. An introduction to arithmetic coding. IBM J. Res. Dev. 1984, 28, 135–149. [Google Scholar] [CrossRef]
- Ziv, J.; Lempel, A. A universal algorithm for sequential data compression. IEEE Trans. Inf. Theory 1977, 23, 337–343. [Google Scholar] [CrossRef]
- Welch, T.A. A technique for high-performance data compression. Computer 1984, 17, 8–19. [Google Scholar] [CrossRef]
- Moffat, A.; Stuiver, L. Binary interpolative coding for effective index compression. Inf. Retr. 2000, 3, 25–47. [Google Scholar] [CrossRef]
- Žalik, B.; Mongus, D.; Lukač, N.; Žalik, K.R. Efficient chain code compression with interpolative coding. Inf. Sci. 2018, 439, 39–49. [Google Scholar] [CrossRef]
- Burrows, M.; Wheeler, D.J. A Block-Sorting Lossless Data Compression Algorithm; Technical Report; Systems Research Center, Digital Equipment Corporation and Digital Systems Research Center: Palo Alto, CA, USA, 1994. [Google Scholar]
- Žalik, B.; Mongus, D.; Lukač, N.; Žalik, K.R. Can burrows-Wheeler transform be replaced in chain code compression? Inf. Sci. 2020, 525, 109–118. [Google Scholar] [CrossRef]
- Adjeroh, D.; Bell, T.; Mukherjee, A. The Burrows-Wheeler Transform: Data Compression, Suffix Arrays, and Pattern Matching; Springer: New York, NY, USA, 2008. [Google Scholar]
- Žalik, B.; Strnad, D.; Podgorelec, D.; Kolingerová, I.; Lukač, L.; Lukač, N.; Kolmanič, S.; Žalik, K.R.; Kohek, Š. A New Transformation Technique for Reducing Information Entropy: A Case Study on Greyscale Raster Images. Entropy 2023, 25, 1591. [Google Scholar] [CrossRef]
- Roelofs, G. PNG: The Definitive Guide; O’Reilly & Associates, Inc.: Sebastopol, CA, USA, 1999. [Google Scholar]
- Weinberger, M.J.; Seroussi, G.; Sapiro, G. The LOCO-I lossless image compression algorithm: Principles and standardization into JPEG-LS. IEEE Trans. Image Process. 2000, 9, 1309–1324. [Google Scholar] [CrossRef] [PubMed]
- Liebchen, T.; Reznik, Y.A. MPEG-4 ALS: An emerging standard for lossless audio coding. In Proceedings of the DCC 2004, Data Compression Conference, Snowbird, UT, USA, 23–25 March 2004; pp. 439–448. [Google Scholar]
- Monkey’s Audio—A Fast and Powerful Lossless Audio Compressor. Available online: https://www.monkeysaudio.com/ (accessed on 8 September 2024).
- Golomb, S. Run-length encodings (corresp.). IEEE Trans. Inf. Theory 1966, 12, 399–401. [Google Scholar] [CrossRef]
- Rice, R.F. Some Practical Universal Noiseless Coding Techniques; Technical Report; Acronym NASA: Pasadena, CA, USA, 1979. [Google Scholar]
- Deutsch, P. DEFLATE Compressed Data Format Specification Version 1.3; Technical Report; RFC Editor: Marina del Rey, CA, USA, 1996. [Google Scholar]
- Pan, D.Y. Digital audio compression. Digit. Tech. J. 1993, 5, 28–40. [Google Scholar]
- Furht, B. A survey of multimedia compression techniques and standards. Part I: JPEG standard. Real-Time Imaging 1995, 1, 49–67. [Google Scholar] [CrossRef]
- Wahid, K. Low complexity implementation of daubechies wavelets for medical imaging applications. In Discrete Wavelet Transforms-Algorithms and Applications; IntechOpen Limited: London, UK, 2011. [Google Scholar]
- Taubman, D.S.; Marcellin, M.W.; Rabbani, M. JPEG2000: Image compression fundamentals, standards and practice. J. Electron. Imaging 2002, 11, 286–287. [Google Scholar] [CrossRef]
- The European Data Strategy. Available online: https://ec.europa.eu/info/strategy/priorities-2019-2024/europe-fit-digital-age/european-data-strategy (accessed on 9 September 2024).
- Green Computing: Latest Practices and Technologies for Ict Sustainability. Available online: https://ictfootprint.eu/en/green-computing-latest-practices-and-technologies-ict-sustainability (accessed on 9 September 2024).
- Kohout, J. On digital image representation by the Delaunay triangulation. In Advances in Image and Video Technology, Proceedings of the Second Pacific Rim Symposium, PSIVT 2007, Santiago, Chile, 17–19 December 2007; Proceedings 2; Springer: Berlin/Heidelberg, Germany, 2007; pp. 826–840. [Google Scholar]
- Demaret, L.; Dyn, N.; Iske, A. Image compression by linear splines over adaptive triangulations. Signal Process. 2006, 86, 1604–1616. [Google Scholar] [CrossRef]
- Žalik, B. An efficient sweep-line Delaunay triangulation algorithm. Comput. Aided Des. 2005, 37, 1027–1038. [Google Scholar] [CrossRef]
- Freeman, H. On the encoding of arbitrary geometric configurations. IRE Trans. Electr. Comp. 1961, EC-10, 260–268. [Google Scholar] [CrossRef]
- Nunes, P.; Marqués, F.; Pereira, F.; Gasull, A. A contour-based approach to binary shape coding using a multiple grid chain code. Signal Process. Image Commun. 2000, 15, 585–599. [Google Scholar] [CrossRef]
- Liu, Y.K.; Žalik, B.; Wang, P.j.; Podgorelec, D. Directional difference chain codes with quasi-lossless compression and run-length encoding. Signal Process. Image Commun. 2012, 27, 973–984. [Google Scholar] [CrossRef]
- Jeromel, A.; Žalik, B. An efficient lossy cartoon image compression method. Multimed. Tools Appl. 2020, 79, 433–451. [Google Scholar] [CrossRef]
- Sriraam, N. A High-Performance Lossless Compression Scheme for EEG Signals Using Wavelet Transform and Neural Network Predictors. Int. J. Telemed. Appl. 2012, 2012, 302581. [Google Scholar] [CrossRef] [PubMed]
- Angeline, M.; Suja Priyadharsini, S. Hybrid compression of biomedical ECG and EEG signals based on differential clustering and encoding techniques. Int. J. Imaging Syst. Technol. 2021, 31, 708–717. [Google Scholar] [CrossRef]
- Martínez, L.A.; Bribiesca, E.; Guzmán, A. Chain coding representation of voxel-based objects with enclosing, edging and intersecting trees. Pattern Anal. Appl. 2017, 20, 825–844. [Google Scholar] [CrossRef]
- Strnad, D.; Kohek, Š.; Nerat, A.; Žalik, B. Efficient representation of geometric tree models with level-of-detail using compressed 3D chain code. IEEE Trans. Vis. Comput. Graph. 2019, 26, 3177–3188. [Google Scholar] [CrossRef] [PubMed]
- Bertalmio, M.; Sapiro, G.; Caselles, V.; Ballester, C. Image inpainting. In Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques, New Orleans, LA, USA, 23–28 July 2000; pp. 417–424. [Google Scholar]
- Qureshi, M.A.; Deriche, M.; Beghdadi, A.; Amin, A. A critical survey of state-of-the-art image inpainting quality assessment metrics. J. Vis. Commun. Image Represent 2017, 49, 177–191. [Google Scholar] [CrossRef]
- Wang, C.; Sun, X.; Wu, F.; Xiong, H. Image compression with structure-aware inpainting. In Proceedings of the 2006 IEEE International Symposium on Circuits and Systems (ISCAS), Island of Kos, Greece, 21–24 May 2006; p. 4. [Google Scholar]
- Liu, D.; Sun, X.; Wu, F.; Li, S.; Zhang, Y.Q. Image compression with edge-based inpainting. IEEE Trans. Circuits Syst. Video Technol. 2007, 17, 1273–1287. [Google Scholar]
- Shen, J.; Chan, T.F. Mathematical models for local nontexture inpaintings. SIAM J. Appl. Math. 2002, 62, 1019–1043. [Google Scholar] [CrossRef]
- Chan, T.F.; Shen, J. Nontexture inpainting by curvature-driven diffusions. J. Vis. Commun. Image Represent 2001, 12, 436–449. [Google Scholar] [CrossRef]
- Galić, I.; Weickert, J.; Welk, M.; Bruhn, A.; Belyaev, A.; Seidel, H.P. Towards PDE-based image compression. In International Workshop on Variational, Geometric, and Level Set Methods in Computer Vision, Proceedings of the Third International Workshop, VLSM 2005, Beijing, China, 16 October 2005; Springer: Berlin/Heidelberg, Germany, 2005; pp. 37–48. [Google Scholar]
- Galić, I.; Weickert, J.; Welk, M.; Bruhn, A.; Belyaev, A.; Seidel, H.P. Image compression with anisotropic diffusion. J. Math. Imaging Vis. 2008, 31, 255–269. [Google Scholar] [CrossRef]
- Peter, P.; Hoffmann, S.; Nedwed, F.; Hoeltgen, L.; Weickert, J. From optimised inpainting with linear PDEs towards competitive image compression codecs. In Image and Video Technology, Proceedings of the 7th Pacific-Rim Symposium, PSIVT 2015, Auckland, New Zealand, 25–27 November 2015; Revised Selected Papers 7; Springer: Cham, Switzerland, 2016; pp. 63–74. [Google Scholar]
- Schmaltz, C.; Peter, P.; Mainberger, M.; Ebel, F.; Weickert, J.; Bruhn, A. Understanding, optimising, and extending data compression with anisotropic diffusion. Int. J. Comput. Vis. 2014, 108, 222–240. [Google Scholar] [CrossRef]
- Peter, P.; Weickert, J. Colour image compression with anisotropic diffusion. In Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France, 27–30 October 2014; pp. 4822–4826. [Google Scholar]
- Hoeltgen, L.; Mainberger, M.; Hoffmann, S.; Weickert, J.; Tang, C.H.; Setzer, S.; Johannsen, D.; Neumann, F.; Doerr, B. Optimising spatial and tonal data for PDE-based inpainting. In Variational Methods; De Gruyter: Vienna, Austria, 2016; pp. 35–83. [Google Scholar]
- Hoeltgen, L.; Peter, P.; Breuß, M. Clustering-based quantisation for PDE-based image compression. Signal Image Video Process. 2018, 12, 411–419. [Google Scholar] [CrossRef]
- Zhou, Q.; Yao, H.; Cao, F.; Hu, Y.C. Efficient image compression based on side match vector quantization and digital inpainting. J. Real-Time Image Process. 2019, 16, 799–810. [Google Scholar] [CrossRef]
- Satpute, Y.Y.; Tidke, B. Data Compression and Hiding Using Advanced SMVQ and Image Inpainting. In Proceedings of the 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 12–14 December 2015; pp. 1074–1077. [Google Scholar]
- Peter, P.; Contelly, J.; Weickert, J. Compressing audio signals with inpainting-based sparsification. In Scale Space and Variational Methods in Computer Vision, Proceedings of the 7th International Conference, SSVM 2019, Hofgeismar, Germany, 30 June–4 July 2019; Proceedings 7; Springer: Cham, Switzerland, 2019; pp. 92–103. [Google Scholar]
- Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
- Li, L.; Fang, Y.; Liu, L.; Peng, H.; Kurths, J.; Yang, Y. Overview of compressed sensing: Sensing model, reconstruction algorithm, and its applications. Appl. Sci. 2020, 10, 5909. [Google Scholar] [CrossRef]
- Stolojescu-Crişan, C.; Isar, A. Images compressive sensing reconstruction by inpainting. In Proceedings of the 2015 International Symposium on Signals, Circuits and Systems (ISSCS), Iasi, Romania, 9–10 July 2015; pp. 1–4. [Google Scholar]
- Gan, L. Block compressed sensing of natural images. In Proceedings of the 2007 15th International Conference on Digital Signal Processing, Cardiff, UK, 1–4 July 2007; pp. 403–406. [Google Scholar]
- Mishra, M.; Sen Gupta, G.; Gui, X. Investigation of energy cost of data compression algorithms in WSN for IoT applications. Sensors 2022, 22, 7685. [Google Scholar] [CrossRef] [PubMed]
- Liu, L.; Nishikawa, H.; Zhou, J.; Taniguchi, I.; Onoye, T. Computer-Vision-Oriented Adaptive Sampling in Compressive Sensing. Sensors 2024, 24, 4348. [Google Scholar] [CrossRef]
- Jiang, F.; Tao, W.; Liu, S.; Ren, J.; Guo, X.; Zhao, D. An end-to-end compression framework based on convolutional neural networks. IEEE Trans. Circuits Syst. Video Technol. 2017, 28, 3007–3018. [Google Scholar] [CrossRef]
- Mondal, U.K.; Debnath, A.; Mandal, J. Deep learning-based lossless audio encoder (dllae). In Intelligent Computing: Image Processing Based Applications; Springer: Singapore, 2020; pp. 91–101. [Google Scholar]
- Schiopu, I.; Munteanu, A. Deep-learning-based lossless image coding. IEEE Trans. Circuits Syst. Video Technol. 2019, 30, 1829–1842. [Google Scholar] [CrossRef]
- Huang, H.; Schiopu, I.; Munteanu, A. Deep learning based angular intra-prediction for lossless HEVC video coding. In Proceedings of the 2019 Data Compression Conference (DCC), Snowbird, UT, USA, 26–29 March 2019; p. 579. [Google Scholar]
- Toderici, G.; Vincent, D.; Johnston, N.; Jin Hwang, S.; Minnen, D.; Shor, J.; Covell, M. Full resolution image compression with recurrent neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 5306–5314. [Google Scholar]
- Agustsson, E.; Tschannen, M.; Mentzer, F.; Timofte, R.; Gool, L.V. Generative adversarial networks for extreme learned image compression. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 221–231. [Google Scholar]
- Huang, Q.; Liu, T.; Wu, X.; Qu, T. A generative adversarial net-based bandwidth extension method for audio compression. J. Audio Eng. Soc. 2019, 67, 986–993. [Google Scholar] [CrossRef]
- Santurkar, S.; Budden, D.; Shavit, N. Generative compression. In Proceedings of the 2018 Picture Coding Symposium (PCS), San Francisco, CA, USA, 24–27 June 2018; pp. 258–262. [Google Scholar]
- Liu, T.; Wang, J.; Liu, Q.; Alibhai, S.; Lu, T.; He, X. High-ratio lossy compression: Exploring the autoencoder to compress scientific data. IEEE Trans. Big Data 2021, 9, 22–36. [Google Scholar] [CrossRef]
- Bolin, C.; Shanzhi, Y.; Peilin, C.; Shiqi, W.; Ye, Y. Generative Visual Compression: A Review. In Proceedings of the 2024 IEEE International Conference on Image Processing (ICIP 2024), Abu Dhabi, United Arab Emirates, 27–30 October 2024. [Google Scholar]
- Lin, J.; Liu, D.; Yang, H.; Li, H.; Wu, F. Convolutional neural network-based block up-sampling for HEVC. IEEE Trans. Circuits Syst. Video Technol. 2018, 29, 3701–3715. [Google Scholar] [CrossRef]
- Zhang, Y.; Shen, T.; Ji, X.; Zhang, Y.; Xiong, R.; Dai, Q. Residual highway convolutional neural networks for in-loop filtering in HEVC. IEEE Trans. Image Process. 2018, 27, 3827–3841. [Google Scholar] [CrossRef] [PubMed]
- Yang, E.H.; Kieffer, J.C. Simple universal lossy data compression schemes derived from the Lempel-Ziv algorithm. IEEE Trans. Inf. Theory 1996, 42, 239–245. [Google Scholar] [CrossRef]
- Luczak, T.; Szpankowski, W. A suboptimal lossy data compression based on approximate pattern matching. IEEE Trans. Inf. Theory 1997, 43, 1439–1451. [Google Scholar] [CrossRef]
- Atallah, M.; Génin, Y.; Szpankowski, W. Pattern matching image compression: Algorithmic and empirical results. IEEE Trans. Pattern Anal. Mach. Intell. 1999, 21, 614–627. [Google Scholar] [CrossRef]
- Di, S.; Cappello, F. Fast error-bounded lossy HPC data compression with SZ. In Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, IL, USA, 23–27 May 2016; pp. 730–739. [Google Scholar]
- Lindstrom, P. Fixed-rate compressed floating-point arrays. IEEE Trans. Vis. Comput. Graph. 2014, 20, 2674–2683. [Google Scholar] [CrossRef]
- Ainsworth, M.; Tugluk, O.; Whitney, B.; Klasky, S. Multilevel techniques for compression and reduction of scientific data—The univariate case. Comput. Vis. Sci. 2018, 19, 65–76. [Google Scholar] [CrossRef]
- Cappello, F.; Di, S.; Li, S.; Liang, X.; Gok, A.M.; Tao, D.; Yoon, C.H.; Wu, X.C.; Alexeev, Y.; Chong, F.T. Use cases of lossy compression for floating-point data in scientific data sets. Int. J. High Perform. Comput. Appl. 2019, 33, 1201–1220. [Google Scholar] [CrossRef]
- Podgorelec, D.; Žalik, B.; Mongus, D.; Vlahek, D. A New Alternating Suboptimal Dynamic Programming Algorithm with Applications for Feature Selection. Mathematics 2024, 12, 1987. [Google Scholar] [CrossRef]
- Žalik, B.; Strnad, D.; Kohek, Š.; Kolingerová, I.; Nerat, A.; Lukač, N.; Lipuš, B.; Žalik, M.; Podgorelec, D. FLoCIC: A few lines of code for raster image compression. Entropy 2023, 25, 533. [Google Scholar] [CrossRef] [PubMed]
- Nerat, A.; Strnad, D.; Žalik, K.R.; Žalik, B. An efficient multi-resolution chain coding. IEEE Access 2024, 12, 54721–54731. [Google Scholar] [CrossRef]
- Strnad, D.; Žlaus, D.; Nerat, A.; Žalik, B. Efficient compressed storage and fast reconstruction of large binary images using chain codes. Multimed. Tools Appl. 2024, 1–19. [Google Scholar] [CrossRef]
- Hácha, F.; Dvořák, J.; Káčereková, Z.; Váša, L. Editing mesh sequences with varying connectivity. Comput. Graph. 2024, 121, 103943. [Google Scholar] [CrossRef]
- Červenka, M.; Skala, V. Behavioral study of various radial basis functions for approximation and interpolation purposes. In Proceedings of the 2020 IEEE 18th World Symposium on Applied Machine Intelligence and Informatics (SAMI), Herlany, Slovakia, 23–25 January 2020; pp. 135–140. [Google Scholar]
Stream | Alphabet Size | Alphabet | # Of Symbols | Entropy | Min. Bitlength | Restoration Success |
---|---|---|---|---|---|---|
UNITED KINGDOM | 10 | D,E,G,I,K,M,N,O,T,U | 13 | 3.239 | 42.106 | Yes |
UMIDID KIMDOM | 6 | D,I,K,M,O,U | 12 | 2.396 | 28.755 | Yes |
UMIDID KIMDUM | 5 | D,I,K,M,U | 12 | 2.230 | 26.755 | No |
OMIDID KIMDOM | 5 | D,I,K,M,O | 12 | 2.230 | 26.755 | No |
# | Genre | Features | OGG kHz | OGG Quality | Orig. APE CR | OGG + APE Res. CR | Comparison CR |
---|---|---|---|---|---|---|---|
1 | Misc. | Linear, average, mask, verbatim, RLE | / | / | 1.77 on average | / | 0.08–0.16 |
2 | Rock | Black Box | Misc. | Misc. | 1.48 | 1.34–1.47 | 0.01–0.14 |
3 | Rock | Black Box | 44.1 | 10 | 1.41 | 1.34 | 0.07 |
4 | Rock | Black Box | 1.378 | 0 | 1.41 | 1.39 | 0.02 |
5 | Classic | Black Box | 2 | 1 | 3.48 | 3.49 | −0.01 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Podgorelec, D.; Strnad, D.; Kolingerová, I.; Žalik, B. State-of-the-Art Trends in Data Compression: COMPROMISE Case Study. Entropy 2024, 26, 1032. https://doi.org/10.3390/e26121032
Podgorelec D, Strnad D, Kolingerová I, Žalik B. State-of-the-Art Trends in Data Compression: COMPROMISE Case Study. Entropy. 2024; 26(12):1032. https://doi.org/10.3390/e26121032
Chicago/Turabian StylePodgorelec, David, Damjan Strnad, Ivana Kolingerová, and Borut Žalik. 2024. "State-of-the-Art Trends in Data Compression: COMPROMISE Case Study" Entropy 26, no. 12: 1032. https://doi.org/10.3390/e26121032
APA StylePodgorelec, D., Strnad, D., Kolingerová, I., & Žalik, B. (2024). State-of-the-Art Trends in Data Compression: COMPROMISE Case Study. Entropy, 26(12), 1032. https://doi.org/10.3390/e26121032