Review on Application of Acoustic MPEG Audio Player: Analytical Essay on Sound Playing

Essay type:
This essay sample was donated by a student to help the academic community. Papers provided by EduBirdie writers usually outdo students' samples.

Cite this essay cite-image

Abstract -

The utilization of advanced versatile music gadgets, for example, MP3 players has quickly expanded amid the most recent decade, and the users usually did not know how the MP3 player works. This article investigates MP3 player the signal processing, it is involve the conversion from analog to digital signal. Also that, in this paper will introduced about classification of signal, properties of signal and system block diagram. The material introduced in this article depends on a subjective meeting study concentrated on MP3 player use as a medium for melodic self-mind. Since MP3 clients can tune in to whatever they need, at whatever point they need, and focus on their music in light of a legitimate concern for overseeing and managing dispositions and feelings, the MP3 player speaks to a profitable and helpful innovation of effect direction.

Index Terms—MP3 Player; Signal Processing; Analog to Digital;

Save your time!
We can take care of your essay
  • Proper editing and formatting
  • Free revision, title page, and bibliography
  • Flexible prices and money-back guarantee
Place an order


Not at all like prior types of music players that required moving parts to peruse encoded information on a tape or CD, MP3 players used solid state memory. A MP3 player is close to an information stockpiling gadget with an installed programming application that enables clients to exchange MP3 records to the player. MP3 players additionally incorporate utilities for duplicating music from the radio, CDs, radio or Web destinations and the capacity to arrange and make custom arrangements of melodies in the request you need to hear them. This rundown of melodies is known as a playlist.

Digital Audio Player or also knows as MP3-Player is an electronic device that can play digital audio files. Audio coding starts with the conversion of analog signal such as speech and music to digital form. The digital signal processed by a digital signal processor according to the requirement to obtain a encoded signal. The digital signal is encoded and then decoded back to get the original analog signal.

This paper propose is review on system properties of signal for application in MP3 player. The basic knowledge characteristic of signal and system is already applied in lecturer.

Digital Signal Processors (DSP) take real-world signals like voice, audio, video, temperature, pressure, or position that have been digitized and then mathematically manipulate them. A DSP is designed for performing mathematical functions like 'add', 'subtract', 'multiply' and 'divide' very quickly.

Signal should be processed so the data contain information can be shown. In the real-world, analog products detect signals such as sound, light, temperature or pressure and manipulate them. Converters such as an Analog-to-Digital converter then take the real-world signal and turn it into the digital format of 1’s and 0’s. From here, the DSP takes over by capturing the digitized information and processing it. It then feeds the digitized information back for use in the real world. It does this in one of two ways, either digitally or in an analog format by going through a Digital-to-Analog converter. All of this occurs at very high speeds.

Figure 1: Flowchart of sEMG signal processing [1]

Four sorts of control command for MP3 player are acknowledged: play, stop, forward and in reverse. Initially, the three channels of sEMG information are recorded all the while. Furthermore, the crude sEMG information are go through a band pass filter of 25-100Hz and a step filter of 50 Hz keeping in mind the end goal to wipe out the effects of curios. Thirdly, the parameter of incorporated EMG is figured for the preprocessed sEMG information,

Where N is the length of information for ascertaining the incorporated EMG and x is the EMG information inside N-length information portion. Besides, the parameter esteems are contrasted and edges and transmitted into double estimations of 0 and 1. At long last, the four control charges for MP3 player are gotten by the blend of double estimations of three channels. [1]

Figure 2: Block Diagram for MP3 Model [4]

Classification of signals in MP3

Figure 3: Signal Waveform [3]

A. Continuous time

As we know that continuous-time (CT) signals are functions whose amplitude or value varies continuously with time, X(t). The symbols t denotes time for continuous-time signals and ( ) use to denote continuous-time value quantities. MP3 is one of the continuous-time class type signals.

B. Odd

As we already know the definition of odd signals is the signals that is a non-symmetrical signals. From the given signals in (a), (b), and (c) it shows that the signals in MP3 is a non-symmetrical type class of signal. Formula for odd signals:

In DT (discrete time)

In CT.

C. Periodic

The following graph (a), (b) and (c) above, it is shown that MP3 signals is also a periodic signals. We can see from signals in (a) that MP3 signals is a periodic signals where X(t) is a function of time that varies the condition:

for all t,

Where T is a positive constant.

D. Random

From diagram (a) MP3 flags likewise demonstrates that it is a random class. The signs in (a) demonstrates that there is vulnerability before it happen. It is likewise might be seen as having a place with a troupe or a gathering of signs in the outfit that having an alternate waveform. It wave or signals formed cannot be determine because it have uncertainty. The signals in diagram 1 shows that each complete cycles have difference uncertainty which shows how random is it. The amplitude of signals in MP3 is also shows that it is fluctuated between positive and negative in randomly style.[image: ]

E. Power

The equation below represents that mp3 signals classification types is power.

the equation for this signals varies the average power of signals satisfy the condition:

The investigation filter bank for MP3 has a place with a class of cross breed filter banks and is involved a polyphase filter bank taken after by an adjusted discrete cosine change. The filter bank disintegrates the info motion into subsampled ghastly segments, which are utilized with the perceptual model to pack the flag, utilizing the marvels of sound-related concealing. Sound-related concealing is a consequence of the sound-related frameworks being not able separate segments of a complex sound. The perceptual model gives appraisals of veiling limits, which are utilized by the quantization and coding square to expel flag segments that are perceptually inconsequential. An endeavor is made to keep the clamor underneath the veiling limit. A nonlinear power-law quantize is additionally utilized, alongside Huffman coding. The result is then gathered into the MP3 bit stream. (Article 9) [2]

Basic operations of signal in MP3

Basic operation was basically about the process in for the function inside the mp3. It’s depending on their time scaling, time reflection and time shifting. Based on three of basic equation it show the movement of signal in mp3 for their input and output process.

Time scaling

In audio coding application, the sub- band signal in processing is very useful. Through analysis filter bank, the sub encoder decompose an input signal x(n) into M different frequency range called sub-band. The sub-band signals (n) are then down sampled to sub- band- signal (m) =(mM) and quantized. The decoder combining the quantized sub-band signals yi ( m) into an output signal i( n) by up sampling them to x;(n) (with x;(mM) = y;(m) and x;(mM +l) = 0 for l = 0, ... , M -l) and going them through the synthesis filter bank them through the synthesis filter bank. The number of sub striation and the decimation -interpolation ratio are not independent; in a critically sampled filter bank, , for which the number of ratio are not independent; in a critically sampled filter bank, for which the number of sub-band signal samples is compeer to the number of samples, the number of sub-bands and the decimation-interpolation ratio are identical.[4]

Figure 2.1 A sub-band coder

The rationale for sub-band signal processing is the need for time- and relative frequency- dependent SNR command to allow transparent encoding of audio frequency and picture signal with very low bit rates. As a matter of fact, when applying uniform quantization to the input signal using a given routine of bits per sample, the resulting quantization noise is uniformly distributed in the frequency range of the signal. Since the power spectral denseness(PSD) of the input signal changes with time and frequency, uniform quantization results in time- and frequency-dependent signal-to noise ratio (local SNR). The number of bits per sample (hence the overall SNR) is therefore usually readiness to a very high value (the 16 bits-96 –element 105 of the compact disk) in society for the worst-case local SNRs to be statistically acceptable to the human ear. In contrast, in sub- band signal processing, the numeral of routine allocated to each sub-band signal can be adapted to the specific SNR required in the corresponding frequency band. This, as we shall see, allows for drastic bit rate step-down. [4]

In filter bank analysis, PCM signals are converted into 32 sub-band signals through a poly phase filter Hi[n]. [3]

Where h[n] is a low-pass filter computed by using a set of coefficients, for Mp3stego. [3]

The original PCM signal and the filtered signals are denoted by x[n] and Pi[n], respectively. [8]

We obtain sub-band signals Si[n], which are Pi[n] down-sampled by 32. [8]


An important feature of the analysis and deduction filter in a filter bank is that they should allow perfect Reconstruction Period (Pr) (or, in practice, nearly perfective tense reconstruction) when no quantization is performed, ie;

In order to examine this problem in a simple case, let us focus on a two channel filter bank (Fig. 2.2) composed of a low-notch (upper) and a high pass (lower) subdivision. [4]

Figure 2.2 A two-channel filter bank

The PR consideration implies to cancel (or at least minimize) three types of distortion, respectively due to the phase and amplitude reception of the filter and to the aliasing that is inevitably introduced by the imperfection of the decimation-interpolation steps. One of the great ideas of sub-band coding, precisely, is that the PR condition does not imply that aliasing should be minimized in each branch separately (which would require close-to-ideal filters) but that some aliasing distortion can be accepted for sub-band signals, provided all sub-band distortions in adjacent sub-band are cancel by the final examination summation. As a issue , analysis and synthetic thinking filters are not chosen independently. [4]

The combined decimation -insertion gradation, which transform each sub-band signal x;( n) into .X;( n ), replace every other sample in x;( n) (i=O,J) by zero. It is thus equivalent to adding to each sub-band signal X; (n) a transcript of it in which every other sample is sign-reversed:

The added signal X;- ( n) can be seen as a modulation of X; ( n) by exp (jn), i.e., by a Nyquist which implies that is thus the quadrature mirror of 1 X/ f) 1, i.e., its mirror with respectfulness to f = 14

Frequency aliasing occurs in the top offset of Figure 2.2 when these DTFTs overlap, as can be seen in equation [4] . A similar conclusion could be drawn for the lower branch.

The output of the filter bank can be written as follows:

In which the second term is due to the quadrature mirror picture of signals x0(n) and x1(n) and is therefore responsible for the possible aliasing.

Figure 3.3 Decimation -interpolation seen as the addition of x0 (n) to the Low sub-band signal. Due to the intersection of their DTFfs in the black triangle, aliasing occurs, which prevents I X0(f) I from being equal to I X 0 (f) I for quintuplet [0, 1/4]. [4]

Equation [3] can be further expanded as follows:

which can be identified with [1] if

The second shape is easy to satisfy with

The first condition in [8] then becomes

C. Reflection

One way of satisfying this restraint was introduced by Esteban and Galland ( 1977), who further imposed ~ ( n) to be the quadrature mirror filter (QMF) of lzo(n):

and found some prototype filter H0 (z) such that

The final filter bank is given in Figure 2.2 The beauty of this idea. Lie in its simplicity.

In recitation ,(Hiz) is a linear phase form (i.e., hin ) is symmetrical) Fir tree filter of length N (N being even1), and in order to implement it as a causal filter its impulse reaction is delayed by (N-l)/2. This results in an overall delay of N-1 samples in both limb of the filter banking company . Pickings into account that

Its can easily find from [11],

It can be shown that designing HJ z) such that it satisfies (14) while being absolute frequency selective is impossible2• Johnston (1980) proposed an iterative overture , which effect in frequency-selective filter banks free of stage and aliasing over refinement , but with some (controllable) amplitude aberration . Many authors have proposed other PR filter with various properties, such as conjugate quadrature filters (CQF). This scientific give-and-take also led to the parallel development of the wavelet possibility.

Figure 3.4 A two-channel QMF filter bank implementing linear phase FIR filters of length N

Properties of MP3 system

MP3 is a shortening for MPEG-1 layer 3 encoding technology. It is broadly utilized for sound pressure, where it furnishes up to 12:1 pressure with almost no discernible debasement of value. It was initially created by the German College of Erlangen and the Frauenhofer Foundation. Afterward, the MPEG set a progression of standard containing distinctive methods for both sound and video pressure. The sound standards included three layers of various many-sided quality and execution. The third layer, layer 3 is equipped for compacting astounding music from 1.4Mbit/s down to 128 Kbit/s with no debasement.[4]

As indicated by the MPEG-1 standard, it is conceivable to transmit/store sound/video signals at a bit rate of 1-2 Mbit/s. There are three layers of pressure and many-sided quality: layer 1, layer 2, layer 3. They contrast in intricacy, quality and all the more vitally, the transfer speed possessed.[9] The MPEG-1 standard is divided into five 'sections': section 1 manages frameworks perspectives; section 2 with video; section 3 with sound pressure; section 4 with consistence testing; and section 5 with programming execution .But, the mp3 does not stand for the third part, it remains for the third layer of sound coding in the MPEG standard. In this way, mp3 is truly layer 3 of section 3 of the MPEG standard. [4]

MP3 is fundamentally perceptual codec, as it depends on the psychoacoustic properties of the human sound-related system (HAS) - for instance, masking. For each tone in a sound flag, there is a concealing edge. Another tone that lies underneath this limit will be conceal and consequently indiscernible. Unintelligible segments in a sound flag are subtle to the HAS and subsequently could be securely wiped out by the encoder. Layer 3 part of the MPEG-1 standard applies Huffman coding, cyclic redundancy coding (CRC), FFT, and an adjusted DCT (MDCT). It characterizes rates between 8 Kbit/s and 320 Kbit/s, however the default rate is generally 128 Kbit/s. a high piece rate implies that the examples are assessed precisely, giving a superior sound determination. Good for nothing rates are determined by the standard: steady piece rate (CBR) and the variable piece rate (VBR). Encoding utilizing CBR (the default) infers that the entire work is encoded utilizing a similar number of bits. Be that as it may, in light of the fact that the elements vary starting with one work then onto the next, sound streams are encoded utilizing an appropriate rate. MP3 documents are separated into outlines, of a span of 26ms at a rate of 38 outline/s. each casing stores 1152 sound examples.[4]

Fraunhofer's fundamental advancement was to utilize a numerical model of human sound-related observation to take into consideration more noteworthy information pressure in mp3 records. In essence, the record is intended to make sense of what you won't hear in any case and to dispose of the information for that bit of the sound. Despite the fact that it is an information record, it has been proposed effectively here that clients regard mp3s as social items. Mp3s resemble different advancements in another imperative way: they are collected by different innovations. The name for a program that amasses mp3s is an encoder. For the reasons for this New Media and Society 8(5)832 contention, an encoder will be dealt with like some other holder innovation that changes its contents. The encoder takes a current computerized recording and produce it through six related advances: [9]

  1. The mp3 encoder separates the flag into little pieces. Called 'outlines', each enduring a small amount of a moment.
  2. The encoder separates each edge into 72 discrete recurrence groups and investigates the sound flag to decide its 'otherworldly vitality appropriation'. It searches for parts of the recurrence range that have a ton of sound vitality in them and parts that have none. Essentially, the calculation tries to make sense of where the most imperative frequencies are in the sound.
  3. The encoder at that point chooses how much information to hold and how much to dispose of, contingent upon the measure of the mp3 that the client needs as a yield document. Relative size and rate of information pressure for mp3 documents are estimated in kilobytes every second, this is on the grounds that the extent of a mp3 is a measure of data transfer capacity for numerous applications – the general purpose of encoding is scaling down. In this way, a greater mp3 record has more kilobytes per second. To make a littler mp3, the encoder needs to discard more information from the first Album recording: it will dispose of something beyond information when it makes a 64kbps record than a 128kbps document.
  4. The encoder figures another tumbrel estimation for each. Outline in light of what it found out about the state of the approaching flag and on a numerical table of qualities that speaks to human psychoacoustic reaction. The fact is to dispose of information that individuals can't hear.
  5. The encoder at that point goes through Huffman coding, which is a standard information pressure calculation intended to dispose of most repetitive information in the record. Like a compress document, it doesn't get free of any information as such, yet spatially solidifies information stockpiling.
  6. At long last, the encoder gathers a 'serial bit stream' which contains header data and guidelines for each casing. These guidelines are for playback projects and gadgets to guarantee predictable playback.

The Binaural Prompt Coding methodology might be joined with a low bitrate sound coder to shape proficient framework for transmission and capacity of multi-channel sound giving two primary useful viewpoints: [10]

Firstly (and probably most importantly), it empowers a bitrate-productive portrayal of multi-channel sound signs. Contrasted with a transmission of C discrete sound channel signals, just a single sound flag must be sent to the decoder together with a conservative arrangement of spatial side data which brings about noteworthy bitrate reserve funds. For instance, the typical 5 channel (3/2) organize is lessened into a solitary entirety sound channel comparing to a general information diminishment of around 80% (i.e. 4 out of 5 channels are dropped, disregarding the conservative BCC side data).

Furthermore, the transmitted aggregate flag compares to a mono downmix of the multi-channel flag. For collectors that don't bolster multi-channel sound proliferation, tuning in to the transmitted entirety flag is in this way a substantial strategy for introducing the sound material on low-profile monophonic multiplication setups. On the other hand, BCC can along these lines additionally be utilized to improve existing administrations including the conveyance of monophonic sound material towards multi-channel sound.

The term DSP is frequently used to depict advanced flag preparing which portrays discrete time flag handling, as well as the extra advance of quantizing the yield of the perfect C/D converter to make a computerized flag which isn't just a discrete-time flag, yet one that is additionally fit for being represented by a settled number of bits, digital portrayal and preparing is regularly favored over direct simple/persistent time preparing for (at least three) principle reasons.

In the first place, discrete-time frameworks are more flexible complex handling, time shifting and versatile filtering, nonlinear preparing, multidimensional signs would all be able to be caught without breaking a sweat in such frameworks. Second, the ensured precision, as controlled by the enlist lengths (number of bits) used to play out the calculations, and not by the non-perfect incorporated circuit segments, including resistors capacitors, inductors, operational enhancers, and different segments whose execution isn't just not perfect, but rather will change after some time and with temperature. At long last computerized executions are regularly littler, less expensive and lower power consumption.

Figure 4: DSP System Block Diagram

Mp3s used psychoacoustic standards to dispose of the sounds that we as far as anyone knows would not hear in any case. There are three specific psychoacoustic traps that mp3 encoders use to decrease the size of information files: concurrent or sound-related covering, worldly concealing and spatialization. Sound-related covering is the disposal of comparative frequencies, in view of the rule that when two hints of comparative recurrence are played together and one is significantly calmer, individuals will hear just the louder sound. Worldly veiling is a comparable rule crosswise over time: if there are two sounds near one another in time (not exactly around five milliseconds separated, contingent upon the material) and one is significantly louder than alternate, audience members can just hear the louder sound. The third rule is spatialization. While it is anything but difficult to find the course of sounds amidst the discernable range when they are played back in stereo, it is near outlandish for individuals to find low or high sounds. To spare more data space, the mp3 encoder spares sounds at either end of the recurrence range once for the two channels, instead of twice and plays them back as mono files. Since most human grown-ups can't hear over 16khz, some mp3 encoders additionally toss out every one of the information from 16– 20khz to spare significantly more space. Psychoacoustically, the mp3 is intended to discard sonic material that audience members as far as anyone knows would not hear something else. [9]

Figure 5: Psychoacoustic Model


As a conclusion, this paper focused about system properties in application MP3 player. The explanation about signal and system already discussed in this paper. However, the basic of operation of MP3 player work also explain how the conversion of signal such as Analog-to-Digital (ADC), Digital Signal Processing (DSP) and Digital-to-Analog (DAC).


  1. Huang, M., Wang, S., Yang, H., Xiao, S., & Wang, B. (2013). A semg-based MP3 player control system. 2013 ICME International Conference on Complex Medical Engineering. doi:10.1109/iccme.2013.6548341
  2. Scala, L. (n.d.). Time-domain simulation for transient stability analysis. Advances in Power System Modelling, Control and Stability Analysis, 311-338. doi:10.1049/pbpo086e_ch9
  3. Qiao, M., Sung, A. H., & Liu, Q. (2013). MP3 audio steganalysis. Information Sciences, 231, 123-134. doi:10.1016/j.ins.2012.10.013
  4. Uhl, T., Paulsen, S., & Nowicki, K. (2017). New approach for determining the QoS of MP3-coded voice signals in IP networks. EURASIP Journal on Audio, Speech, and Music Processing, 2017(1). doi:10.1186/s13636-016-0099-4
  5. Giovanardi, A., Mazzini, G., & Tomassetti, M. (n.d.). Chaos based audio watermarking with MPEG psychoacoustic model I. Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint. doi:10.1109/icics.2003.1292739
  6. Kuech, F., Mitnacht, A., & Kellermann, W. (n.d.). Nonlinear Acoustic Echo Cancellation Using Adaptive Orthogonalized Power Filters. Proceedings. (ICASSP 05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.doi:10.1109/icassp.2005.1415657
  7. Moon, H. (2012). A Low-Complexity Design for an MP3 Multi-Channel Audio Decoding System. IEEE Transactions on Audio, Speech, and Language Processing, 20(1), 314-321. doi:10.1109/tasl.2011.2161081
  8. Bazyar, M., & Sudirman, R. (2014). A Recent Review of MP3 Based Steganography Methods. International Journal of Security and Its Applications, 8(6), 405-414. doi:10.14257/ijsia.2014.8.6.35
  9. Sterne, J. (2006). The mp3 as cultural artifact. New Media & Society, 8(5), 825-842. doi:10.1177/1461444806067737
  10. Brixen, E. B. (2011). Multi-Channel and Surround Sound. Audio Metering,135-150. doi:10.1016/b978-0-240-81467-4.10020-6
Make sure you submit a unique essay

Our writers will provide you with an essay sample written from scratch: any topic, any deadline, any instructions.

Cite this paper

Review on Application of Acoustic MPEG Audio Player: Analytical Essay on Sound Playing. (2022, September 27). Edubirdie. Retrieved June 23, 2024, from
“Review on Application of Acoustic MPEG Audio Player: Analytical Essay on Sound Playing.” Edubirdie, 27 Sept. 2022,
Review on Application of Acoustic MPEG Audio Player: Analytical Essay on Sound Playing. [online]. Available at: <> [Accessed 23 Jun. 2024].
Review on Application of Acoustic MPEG Audio Player: Analytical Essay on Sound Playing [Internet]. Edubirdie. 2022 Sept 27 [cited 2024 Jun 23]. Available from:

Join our 150k of happy users

  • Get original paper written according to your instructions
  • Save time for what matters most
Place an order

Fair Use Policy

EduBirdie considers academic integrity to be the essential part of the learning process and does not support any violation of the academic standards. Should you have any questions regarding our Fair Use Policy or become aware of any violations, please do not hesitate to contact us via

Check it out!
search Stuck on your essay?

We are here 24/7 to write your paper in as fast as 3 hours.