Zoom out Search Issue

ManualsBrandsContents Manualsaudio & home theatreZoom in

[

VOLUME 32 NUMBER 2 MARCH 2015

]

Contents | Zoom in | Zoom out Search Issue | Next PageFor navigation instructions please click here

Summary of content (231 pages)

PAGE 1
Contents | Zoom in | Zoom out For navigation instructions please click here Search Issue | Next Page [VOLUME 32 NUMBER 2 MARCH 2015] Contents | Zoom in | Zoom out For navigation instructions please click here Search Issue | Next Page
PAGE 2
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Ultra Small 2x2mm 2W ATTENUATORS Save PC board space with our new tiny 2W fixed value absorptive attenuators, available in molded plastic or high-rel hermetic nitrogen-ﬁlled ceramic packages. They are perfect building blocks, reducing effects of mismatches, harmonics, and intermodulation, improving isolation, and meeting other circuit level requirements.
PAGE 3
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [CONTENTS] [VOLUME 32 NUMBER 2] [SPECIAL SECTION—SIGNAL PROCESSING TECHNIQUES FOR ASSISTED LISTENING] 16 FROM THE GUEST EDITORS Sven Nordholm, Walter Kellermann, Simon Doclo, Vesa Välimäki, Shoji Makino, and John R.
PAGE 4
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [IEEE SIGNAL PROCESSING magazine] Min Wu—Editor-in-Chief University of Maryland, College Park United States AREA EDITORS Feature Articles Shuguang Robert Cui—Texas A&M University, United States Special Issues Wade Trappe—Rutgers University, United States Columns and Forum Gwenaël Doërr—Technicolor Inc.
PAGE 5
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Now... 2 Ways to Access the IEEE Member Digital Library With two great options designed to meet the needs—and budget—of every member, the IEEE Member Digital Library provides full-text access to any IEEE journal article or conference paper in the IEEE Xplore® digital library.
PAGE 6
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [from the EDITOR] Min Wu Editor-in-Chief minwu@umd.edu ____________ Sharing Signal Processing with the World I am writing this editorial for the March issue of IEEE Signal Processing Magazine (SPM) as 2014 comes to a close.
PAGE 7
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® LOW NOISE BYPASS AMPLIFIERS 89 from ea.(qty.1000) 1 500 MHz-5 GHz Very rarely does a new product achieve many breakthrough features in one model. Mini-Circuits’ TSS-53LNB+ is this rare exception.
PAGE 8
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [president’s MESSAGE] Alex Acero 2014–2015 SPS President a.acero@ieee.org ____________ The IEEE Signal Processing Cup: A Competition for Undergraduate Students S ignal processing is becoming part of the curriculum in many undergraduate engineering programs.
PAGE 9
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [reader’s CHOICE] Top Downloads in IEEE Xplore T he “Reader’s Choice” column in IEEE Signal Processing Magazine contains a list of articles published by the IEEE Signal Processing Society (SPS) that ranked among the top 100 most downloaded IEEE Xplore articles. This issue is based on download data through June 2014.
PAGE 10
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [reader’s CHOICE] continued MAR 2014 FEB 2014 JAN 2014 N TIMES IN TOP 100 (SINCE JAN 2011) 55 92 27 12 RANK IN IEEE TOP 100 TITLE, AUTHOR, PUBLICATION YEAR IEEE SPS PUBLICATIONS ABSTRACT JUN 2014 MAY 2014 81 APR 2014 IMAGE SUPER-RESOLUTION VIA SPARSE REPRESENTATION Yang, J.; Wright, J; Huang, T.S.; Ma, Y. IEEE Transactions on Image Processing vol. 19, no.
PAGE 11
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [society NEWS] SPS Fellows and Award Winners Recognized I n this column of IEEE Signal Processing Magazine, 51 IEEE Signal Processing Society (SPS) members are recognized as Fellows, and award recipients are announced. 51 SPS MEMBERS ELEVATED TO FELLOW Each year, the IEEE Board of Directors confers the grade of Fellow on up to onetenth of 1% of the Members.
PAGE 12
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [society NEWS] continued perceptual image and video compression and quality. Paris Smaragdis, Urbana, Illinois, United States: For contributions to audio source separation and audio processing. Hing Cheung So, Kowloon, China: For contributions to spectral analysis and source localization.
PAGE 13
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE SPS transactions or IEEE Journal of Selected Topics in Signal Processing, in an issue predating the Spring Awards Board meeting by at least ten years (typically held in conjunction with ICASSP). The recipients of the first Sustained Impact Paper Award are: ■ Stephane G.
PAGE 14
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [special REPORTS] John Edwards Signal Processing Drives a Medical Sensor Revolution S ensor technology’s impact on health care is growing rapidly. New applications are appearing almost daily.
PAGE 15
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Signal processing is also essential for creating receivers that can cope with an onslaught of data coming in from large numbers of ultrasonic sensors floating inside a body. “Basically, solving some mathematical optimization problems gives us the best way to share the channel between different devices that are trying to transmit at the same time,” Melodia explains.
PAGE 16
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [special REPORTS] continued [FIG2] A multielectrode nerve cuff used for velocity-selective recordings made by Martin Schuettler, a senior scientist at the University of Freiburg and chief technology officer of CorTek, a Freiuburg, Germany-based developer of a neurotechnological platform for measuring and stimulating of brain activity. (Photo credit: Martin Schuettler.
PAGE 17
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® very small signals. “We have worked extensively to improve electrode and amplifier designs so as to stabilize the electrode characteristics and maximize the possible recorded SNR,” Taylor says. Taylor notes that the group’s signal processing algorithms are still incomplete.
PAGE 18
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [from the GUEST EDITORS] Sven Nordholm, Walter Kellermann, Simon Doclo, Vesa Välimäki, Shoji Makino, and John R. Hershey Signal Processing Techniques for Assisted Listening N atural hearing is a desirable goal in many electronic communication applications, such as hearing aids, audio conferencing, gaming, and virtual reality applications.
PAGE 19
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® reviews audio enhancement techniques for music listening in a noisy environment. A third article in this area, “Natural Sound Rendering for Headphones,” by Sunder et al., is an overview of techniques for rendering via headsets for applications in threedimensional audio.
PAGE 20
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Simon Doclo, Walter Kellermann, Shoji Makino, and Sven Nordholm] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 21
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Signal Acquisition Signal Enhancement Signal Presentation Out Right Right Monaural or Binaural Output w(k, l ) x(k, l ) Out Left Left Localization/Binaural Cue Determination Source Position and Binaural Cues [FIG1] The main processing blocks in an ALD. listening devices (e.g.
PAGE 22
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® P-1 ... (often termed beamforming) and filtering in the time-frequency x m (k, ,) = / h p, m (k) s p (k, ,) + n m (k, ,), m = 1 f M, (2) p=0 domain, respectively. In addition to exploiting the statistics of the available observations, the optimum filter design should also use available prior knowledge, e.g.
PAGE 23
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® spacings ranging from 7 mm to 15 mm. Since the positions of the microphones do not coincide with the ear drum, and the acoustic path between the loudspeaker and the ear drum differs from the HRTF, the overall response of the device should be equalized to match the open-ear HRTF [3].
PAGE 24
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® matrix must be sufficiently reliable for every frequency bin. geometric inference of the source position. The latter class comSince subspace-based algorithms are separating the signal and prises cross-correlation-based [8] and cross-relation-based algonoise subspace, where the noise needs to be white or whitened, rithms, e.g., [9] and [10].
PAGE 25
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® variance (LCMV) beamformer [20], [21], where the power of the output signal is minimized subject to a single constraint assuring an undistorted response for the target source (or a filtered version of it). Different versions of the MVDR beamformer exist, either using the complete target ATF, the direct path of the ATF, or the relative transfer functions (RTFs).
PAGE 26
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® MVDR USING RTFs By constraining the desired component in the output signal to be equal to the speech component at an arbitrarily chosen reference microphone r [24], the constraint in (6) becomes ! y s0 = w H h 0 s 0 = h 0, r s 0, (11) which is equivalent to w H hu 0 = 1, where the RTF hu 0 is defined as h 0, 1 h 0, 2 h 0, M T 9 E.
PAGE 27
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® density matrix z s0 s0 h 0 h 0H can be estimated from the secondorder statistics of the microphone signals; cf. the section “Estimation of Interference and Noise Statistics.
PAGE 28
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Magnitude Response for Directional BSS 0 −80 −60 −40 −20 0 20 40 60 80 −5 −10 −15 −20 −25 −30 −35 0 1 2 3 4 5 Frequency (kHz) (a) 6 7 8 φ (°) φ (°) Magnitude Response for Delay and Subtract BF 0 −80 −60 −40 −20 0 20 40 60 80 −5 − −10 − −15 − −20 − −25 − −30 − −35 − 0 1 2 3 4 5 Frequency (kHz) (b) 6 7 8 [FIG6] Interference cancelation in a reverberant envir
PAGE 29
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® xR, 1(k, ) xL, 2(k, ) xR, 2(k, ) ... xL, 1(k, ) ... desired and the undesired components vary in an unpredictable way and call for instantaneous estimates. In the spectrotemporal domain, voice activity detection and speech presence probability estimation typically aim at identifying regions in the STFT domain where only undesired components are present, e.g., [6, ch.
PAGE 30
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 0 1, 00 0 2, 00 0 3, 00 0 4, 00 0 5, 00 0 6, 00 0 7, 00 0 8, 00 0 SNR Gain (dB) Signal Extraction” essentially generate a single-channel output signals y L and y R can be generated and presented to the left signal. Since in a binaural system two output signals (i.e., one for and the right ear (cf. Figure 7).
PAGE 31
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ACKNOWLEDGMENTS APPLICATION IN ALDs This research was supported in part by the Research Unit FOR In [30] and [44], the performance of the binaural MWF and some 1732 “Individualized Hearing Acoustics” and the Cluster of Excelof its extensions has been perceptually evaluated, both in terms of lence 1077 “Hearing4All,” funded by the German Research Founspeech intelligibility and l
PAGE 32
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® acoustic media. He founded Sensear in 2006 and is also its technology inventor. He is a member of the IEEE Signal Processing Society Technical Committee on Audio and Acoustic Signal Processing and is an associate editor of Journal of the Franklin Institute. [27] A. Spriet, M. Moonen, and J.
PAGE 33
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [ Konrad Kowalczyk, Oliver Thiergart, Maja Taseska, Giovanni Del Galdo, Ville Pulkki, and Emanuël A.P. Habets ] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 34
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® User Settings Direct Microphone Signals Spatial Analysis Storage Diffuse Direct Diffuse Transmission Parameters (Optional) Processing and Synthesis Output Signal(s) Parameters [FIG1] A high-level overview of the parametric spatial sound processing scheme.
PAGE 35
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® reproduction with arbitrary setups or for binaural reproduction [8]. Hence, instead of transmitting many microphone signals and carrying out the entire processing at the receiving side, only two signals (i.e., the direct and diffuse signals) need to be transmitted together with the parametric information.
PAGE 36
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® where the vector x (k, n) = [X (k, n, d 1), f, X (k, n, d M )] T contains the M microphone signals in the time-frequency domain, where d 1fM are the microphone positions. Without loss of generality, the first microphone located at d 1 is used as a reference microphone.
PAGE 37
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® SIGNAL EXTRACTION SINGLE-CHANNEL FILTERS A computationally efficient estimation of the direct and the diffuse components is possible using single-channel filters. Such processing is applied for instance in DirAC [1], where the direct and diffuse signals are estimated by applying a spectral gain to a single microphone signal.
PAGE 38
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Single-Channel Extraction Multichannel Extraction 0 0 −10 −10 −20 4 −30 −40 2 −50 −60 0 100 200 Time Frame Index (b) −20 4 −30 100 200 Time Frame Index (c) −40 2 −50 100 200 Time Frame Index (a) Diffuse Sound −60 0 0 6 Frequency [kHz] Frequency [kHz] 6 Direct Sound Input Signal Frequency [kHz] 6 −10 −20 4 −30 −40 2 −50 0 −60 100 200 Time Frame
PAGE 39
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® PARAMETER ESTIMATION For the computation of the filters described in the previous section, the required parameters need to be estimated. In singlechannel extraction, one parameter needs to be estimated, specifically the signal-to-diffuse ratio SDR (k, n) or the diffuseness W (k, n) .
PAGE 40
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q Frequency [kHz] Frequency [kHz] Frequency [kHz] Frequency [kHz] Frequency [kHz] Gain Functions THE WORLD’S NEWSSTAND® Pleft(θ ) B(θ ) Pright(θ ) 1 0.
PAGE 41
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® depicted in Figure 6(a). To reproduce the diffuse sound, the signals Yd, i (k, n) are decorrelated such that Yd, i (k, n) and Yd, j (k, n) for i ! j are uncorrelated [29]. Note that the less correlation between the loudspeaker channels, the more enveloping the perceived sound is. The described processing for synthesizing the loudspeaker signals is depicted in Figure 5(b).
PAGE 42
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® microphone position d 1 . As in [27], the real factors are typically applied which compensate for the amplitude change following the 1/r law, where r is the propagated distance.
PAGE 43
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® estimates and degree of diffuseness. The evaluation of different setups with one desired talker and one interfering talker demonstrated that an improvement in the speech reception threshold (SRT) between 4 and 24 dB could be obtained.
PAGE 44
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Milano, Italy. In 2007, he received his doctoral degree from Technische Universität Ilmenau on the topic of channel modeling for mobile communications. He then joined Fraunhofer Institute for Integrated Circuits IIS working on audio watermarking and parametric representations of spatial sound.
PAGE 45
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Bastiaan Kleijn, João B. Crespo, Richard C. Hendriks, [W.Petko N. Petkov, Bastian Sauert, and Peter Vary ] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 46
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® To define quantitative instrumental measures of intelligibility, we consideration of the rendering environment. It is also a major facmust select a level of abstraction.
PAGE 47
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® is nonnegative and cannot be larger than the entropy H (M˘ T). Thus, the difference D (M˘ L, M˘ T) = H (M˘ T) - I (M˘ L; M˘ T) is nonnegative and can be interpreted as a distortion.
PAGE 48
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 10 0 −10 −20 −30 −40 −50 −60 −70 −80 Constant 1 Response Level (dB) acoustic features are based on the Probability-Based Enhancement,” THE LACK OF FEEDBACK, short-time discrete Fourier transthe standard approach to ASR comTOGETHER WITH THE RECENT form (DFT) coefficients, variance putes the probability of the observaABILITY TO COMMUNICATE estimation can be based on the tions
PAGE 49
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® middle ear. This filter is cascaded with an auditory filter bank that models processing at the level of the basilar membrane in the cochlea. Subsequently, the envelope of each of the outputs of the auditory filters is obtained, which simulates the transduction of the inner hair cells. To model an absolute hearing threshold, a constant is added to each envelope.
PAGE 50
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® temporal (low level) modifications. SIGNAL PROCESSING THE CLASSIC SII HAS PROVEN In accordance with the Markov APPROACHES TO BE HIGHLY CORRELATED chain model of the communication In this section, the focus is on creWITH SPEECH INTELLIGIBILITY process, presented in the section ating practical enhancement sysIN MANY CONDITIONS AND HAS “Defining Intelligibility,” a hightems.
PAGE 51
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 2 va Low-level modifications do not require knowledge of the E i = 10 log 10 ( T,i ) - 10 log 10 (v 20), (10) fD, i intended message transcription. These can be subdivided into spectral, temporal, and spatial signal modifications as well as combinations thereof.
PAGE 52
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® we denote it for a band i as A (E i, D i). Let us define the piecewise linear sigmoid S (x; b 1, b 2) = ^max (min (x, b 2), b 1) - b 1 h / (b 2 - b 1), which has a range [0, 1] .
PAGE 53
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® straightforward optimization problems that can be solved using the Karush–Kuhn–Tucker conditions. The resulting analytic solutions are easy to implement. The later work of [8] models A (E i, D i) more accurately at low SNR values and provides improved performance over the original work of [4] under low SNR conditions. The discussion in this section assumed stationarity.
PAGE 54
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® effects can be included in the affine ENHANCEMENT OVER MULTIPLE IN ANNOUNCEMENT signal model given by [16] SPATIAL POINTS SCENARIOS IN PUBLIC We have considered preprocessing SPACES SUCH AS AIRPORTS, techniques that do not consider the a L = H E au T + v E , (17) TRAIN STATIONS, OR SHOPPING spatial aspects of the rendering sceMALLS, ENVIRONMENTAL NOISE nario.
PAGE 55
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® To demonstrate the use of the optimality conditions (19), let us consider the simple , 2 distortion measure given by d (a T, a L) = a L - a T 2 , (20) long history and were derived heuristically, are found to be consistent with communication theory. While the field of intelligibility enhancement has developed rapidly, opportunities for significant improvement remain.
PAGE 56
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® in 2003 and 2008, respectively. He was a Ph.D. researcher (2003–2007) and a postdoctoral researcher (2007–2010) at Delft University of Technology. In 2005, he was a visiting researcher at the Institute of Communication Acoustics, Ruhr-University Bochum, Germany, and in 2008–2009 he was a visiting researcher at Oticon A/S, Denmark.
PAGE 57
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Timo Gerkmann, Martin Krawczyk-Becker, and Jonathan Le Roux ] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 58
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® through an iSTFT operation, denoted by Kx = iSTFT (M X ) . For this, the inverse DFT of the STFT coefficients is computed and each segment is multiplied by a synthesis window w s (n - ,R); the windowed segments are then overlapped and added to obtain the modified time-domain signal.
PAGE 59
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® wrapped to its principle value, i.e., - r # z kX, , = +X k, , # r. To reveal these structures, alternative representations have been proposed, which consider phase relations between neighboring time-frequency points instead of absolute phases. Two examples of such representations are depicted in Figure 1(c) and (d).
PAGE 60
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® such as nonnegative matrix factorization, hidden Markov models, and discriminative approaches such as deep neural networks. However, mainstream approaches have tended to ignore the phase, mainly due to the difficulty of modeling it and the lack of clarity about its importance, as discussed next.
PAGE 61
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Indeed, one can easily show that d (x (i + 1), A)# I (z (i))# d (x (i), A) . Interestingly, if only parts of the phase are updated according to (3), the nondecreasing property still holds for I (z), but whether it still does for d (x, A) has not been established.
PAGE 62
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® contribution to the signal is obtained by the inverse DFT of the phase z (,i) combined with the target magnitude; frame , ’s contribution is then combined by overlap-add to the contribution of the previous frames, leading to a signal estimate for frame , ; the phase z (,i + 1) is estimated as the phase of this signal estimate to which the analysis window is applied.
PAGE 63
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® with real-valued amplitude A h and initial time-domain phase { h for harmonic component h. Due to the fixed relation between the frequencies, (5) is also referred to as the harmonic model, which is a special case of the more general sinusoidal model.
PAGE 64
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® explain the observed spectral coeffiharmonic are directly related to each THE PHASE OF TRANSIENT cients of the mixture. In [26] (and other through the phase response of SOUNDS IS NOT ONLY RELEVANT the references therein), Mowlaee and the analysis window z Wk ; see, e.g., [4] FOR DETECTION, BUT ALSO Saedi proposed to solve this ambigufor more details.
PAGE 65
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Enhanced Speech Before Phase Randomization Noisy Speech Enhanced Speech After Phase Randomization 1 1 1 0.5 0.5 0.5 0 0 0 –0.5 –0.5 –0.5 –1 0.2 0.4 0.6 Time (s) (a) 0.8 –1 0.2 0.4 0.6 Time (s) (b) 0.8 –1 0.2 0.4 0.6 Time (s) (c) 0.8 [FIG4] (a) Speech degraded by a click train.
PAGE 66
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® PESQ Improvement (MOS) PESQ Improvement (MOS) the magnitudes. With this approach, enhancement. As a classical Wiener WHEN AN INITIAL PHASE convergence is reached after only filter only changes the magnitudes in ESTIMATE IS ALSO EMPLOYED few iterations.
PAGE 67
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® magnitudes, in this article we ee.ic.ac.uk/hp/staff/dmb/voicebox/ ______________________ A PROMISING APPROACH reviewed methods that also involve voicebox.html). Because with [4] we ________ FOR PERFORMANCE IMPROVEMENT STFT phase modifications.
PAGE 68
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® STFT analysis and synthesis. Further research could therefore also address phase estimation using low latency filter banks.
PAGE 69
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Jan Wouters, Hugh J. McDermott, and Tom Francart] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.COM/NCANDRE EARPHONES—IMAGE LICENSED BY INGRAM PUBLISHING Sound Coding in Cochlear Implants [From electric pulses to hearing] C ochlear implantation is a life-changing intervention for people with a severe hearing impairment [1].
PAGE 70
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Microphone Preprocessing Stimulation Strategy RF Transmission Decoder Pulse Generation Electrode Array [FIG1] A block diagram of a complete CI system. language skills at a young age. In many countries, a single CI is reimbursed by health insurance organizations, and in some countries, the cost of a second CI is also reimbursed, primarily for children.
PAGE 71
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® impedances of the stimulation channels can be measured (which may lead to deactivation of some electrodes if faults are detected) and parameters of the preprocessing stage can be adjusted.
PAGE 72
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® F0mod eTone F0 Feature Extraction 13 Modulation Enhancement 12 Channel Selection 7 MP3000 Masking Model 8 Channel Selection 7 EE Onset Enhancement 11 Microphones 1 Channel Selection ACE Front-End Processing 2 Channel Selection Envelope Detection Filter Bank 3 7 CIS Mapping 5 4 FSP Temporal Feature Enhancement 9 (a) 7 Electrical Stimulation 6 HiRes120 Current
PAGE 73
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® the normal anatomical or neurophysiological place because generally electrode arrays do not allow insertion beyond the anatomical position corresponding to acoustic frequencies lower than 500–1,000 Hz. However, studies have shown that with time of use of the CI, cortical plasticity can partly compensate for this mismatch.
PAGE 74
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Synthesized Vowel ah MP3000 Waveform EE 1 0.5 0 –0.5 20 Relative Magnitude Relative Magnitude Relative Magnitude 20 15 10 5 –1 0 0.05 0.1 0.15 Time (s) (a) 0.2 0 0.05 0 1,600 800 400 0.2 15 10 15 10 0 0.05 0.1 0.15 Time (s) (e) 0.2 0 0.05 HiRes120 10 5 0.1 0.15 Time (s) (f) 0.2 FSP 12 15 Magnitude + Electrode Magnitude + Electrode 15 0.
PAGE 75
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® “A Boy Fell from the Window” MP3000 Waveform EE 2 0 –1 a boy –2 0 fell 0.5 15 10 5 from the window 1 Time (s) (a) 20 Relative Magnitude Relative Magnitude Relative Magnitude 20 1 1.5 0 0.5 0 1,600 800 400 1.5 15 10 15 10 0 0.5 1 Time (s) (e) 1.5 0 0.5 HiRes120 10 5 1 Time (s) (f) 1.
PAGE 76
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® “A Boy Fell from the Window”—Zoomed in on “Boy” MP3000 Waveform EE 2 20 0 –1 boy –2 0.3 0.4 Time (s) (a) 15 10 0.5 0.2 Spectrogram 0.5 0.2 1,600 800 400 15 10 0.5 15 10 5 0.2 CIS-22ch 0.3 0.4 Time (s) (e) 0.5 0.2 HiRes120 15 10 5 0.3 0.4 Time (s) (f) 0.5 FSP 12 15 Magnitude + Electrode Magnitude + Electrode 20 0.5 20 5 0.3 0.
PAGE 77
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Figures 4–6 show that ACE represents some speech formant peaks and formant trajectories (i.e., changes in formant frequency over time) more distinctly than CIS, particularly when background noise is present.
PAGE 78
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® “A Boy Fell from the Window” MP3000 Waveform EE 2 0 –1 0.5 1 Time (s) (a) Spectrogram 0.5 800 400 1 Time (s) (d) 1.5 0 0.5 1 Time (s) (c) F0mod 1.5 0 0.5 1 Time (s) (f) 1.5 20 15 10 15 10 5 5 0.5 10 1.5 Frequency (Hz) 1,600 200 0 0.5 CIS-22ch 1 Time (s) (e) 1.
PAGE 79
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® partially overlap, then temporal information from closely spaced electrodes will generally be combined at the neural level. Psychophysical studies have reported evidence that temporal patterns from nearby electrodes cannot be completely resolved by most CI recipients.
PAGE 80
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® running spectral analysis and the distribution of current levels across electrodes is determined such that the loudness experienced by the CI user is similar to that experienced by an average listener with NH. Preliminary perceptual studies with CI recipients using SpeL confirmed that the relation between loudness and the level and bandwidth of sounds was closer to normal [18].
PAGE 81
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® thresholds around 50–100 μs and, in the worst case, no ITD sensitivity at all. However, with commercial sound processors, subjects hardly use ITDs in ecologically relevant tasks such as sound source localization. This is at least partly due to poor coding of temporal cues by current commercial sound processing strategies.
PAGE 82
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® AUTHORS Jan Wouters (jan.wouters@med.kuleuven.be) __________________ obtained M.S. and Ph.D. degrees in physics from the University of Leuven, KU Leuven, Belgium, in 1982 and 1989, respectively, with an intermission for officer military service.
PAGE 83
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Terence Betlehem, Wen Zhang, Mark A. Poletti, and Thushara D. Abhayapala] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 84
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® g (y1) g (y2) Zone q ro Oq R Zone 1 Zone Q HQ g (yl) g (yL) (a) (b) [FIG1] (a) An illustration of personal sound zones in an office environment. (b) A loudspeaker array is used to create multiple sound zones for multiple listeners. concept of a personal sound zone, i.e., reproducing sound within a desired region of space with a reduced sound level elsewhere.
PAGE 85
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® loudspeaker weights are chosen to ensure the implementation is robust to driver positioning errors and changes in the acoustic environment. The ACC problem can then be posed as max Hb g g 2 subject to H d g 2 g 2 (2a) # D0 (2b) # E0 .
PAGE 86
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 1 3 0.6 2 0.4 y (m) 1 0.2 0 0 –0.2 –1 –0.4 –2 –0.6 –3 –0.8 –3 –2 –1 0 x (m) (a) 1 2 3 –1 Acoustic Contrast (dB) 0.
PAGE 87
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® the loudspeaker weights and solved using the least-absolute shrinkage and selection operator [16]. The assumption here is that the desired sound field can be reproduced by a few loudspeakers, which are placed close to the direction of the virtual source and are sparsely distributed in space.
PAGE 88
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® matrices are large and may have issues with computational requirements (for filtered x-RLS) and convergence rates (for filtered x-LMS). Poor convergence can be solved using eigenspace adaptive filtering [22] by performing a generalized singular value decomposition (SVD) to diagonalize the system. Unfortunately the SVD still incurs a high computational cost.
PAGE 89
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Magnitude (dB) −10 −20 −30 −40 −50 −60 0 0 −10 −10 −20 −20 −30 −40 −50 −60 −70 −70 −80 −80 100 200 300 Time (ms) (a) Magnitude (dB) Delivered Crosstalk Magnitude (dB) 0 −30 −40 −50 −60 −70 100 −80 200 300 Time (ms) (b) 100 200 300 Time (ms) (c) [FIG5] The shortening of impulse responses to 50 ms in a room of reverberation time 250 ms using (a) relaxed m
PAGE 90
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Standard loudspeakers typically have insufficient directivity to provide a significant enhancement of direct sound in a reverberant space. High directivity can be achieved using traditional array techniques such as delay and sum beamforming, but the array size must be large at low frequencies to achieve significant directivity.
PAGE 91
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 10 0 10 0 0 –20 –20 dB –10 dB –10 –30 –30 1 –50 –40 3 –60 101 1 2 2 –40 0 4 –50 3 4 5 5 102 103 Frequency (Hz) (a) 105 –60 101 102 103 Frequency (Hz) (b) 105 [FIG7] The normalized magnitude of the mode responses of (a) a spherical source and (b) a cylindrical source for orders 0–5. For example, a third-order speaker with radius a = 0.
PAGE 92
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® radiation from the rear of the driver, although the directivity results can be less accurate with frequency [7]. ARRAYS OF DIRECTIONAL SOURCES If multiple directional loudspeakers are available, then it becomes possible to create multiple zones of sound. Multizone reproduction requires a large number of monopole loudspeakers.
PAGE 93
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® include active room compensation, microphone array processing, and room acoustic modeling. Wen Zhang (wen.zhang@anu.edu.au) ______________ received the M.E. and Ph.D. degrees in electrical engineering from the Australian National University in 2005 and 2010, respectively.
PAGE 94
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Vesa Välimäki, Andreas Franck, Jussi Rämö, Hannes Gamper, and Lauri Savioja] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 95
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Attenuation (dB) Figure 2 shows measured isolation curves of different types of headphones, where the black solid line is the isolation +Head Tracking +Mic Augmented Virtual +Binaural Synthesis +Mixing curve of an open-back circum-aural (CA) Reality Reality hi-fi headphone, the green dashed-dotted Audio Audio line is the isolation of a closed-back supra-aural (SA) headphone, t
PAGE 96
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® SPL (dB) distance, because the ratio between sources, also on the distance of the BINAURAL SYNTHESIS IS direct and reverberant energy source. Spectral cues (SCs), caused PERFORMED BY FILTERING A decreases with increasing source by the reflection or diffraction by SOURCE SIGNAL WITH THE distance [8].
PAGE 97
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Position Delay Computation HRTF Selection HRTF Database HRTF Interpolation Audio Convolution Crossfading Convolution Crossfading Delay Line Acoustic Scene [FIG5] The signal flow of a dynamic binaural synthesis system for multiple sound sources. provide the required hardware for computer-vision-based head tracking for mobile applications.
PAGE 98
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® However, other types of headphones, several advantages. First, it lowers AN ADDITIONAL CONSTRAINT such as closed and IE headphones, the required filter orders of the IN A HEAR-THROUGH SYSTEM block the ear canal and suppress outHRTFs, reducing computational and IS ITS LATENCY, OR THE TIME side sounds. The hear-through mode memory requirements.
PAGE 99
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® the given sequence while the external microphone signal is filtered with an FIR filter having the allpass tail impulse response. ASSISTED LISTENING APPLICATIONS We limit the scope of assisted listening to such applications that help listening in a noisy environment or that employ augmented and modified reality technologies.
PAGE 100
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® system to implement time-varying short-distance navigation and object THE INCREASE IN binaural synthesis were discussed. avoidance in which spatialized sound COMPUTATIONAL POWER Audio-augmented reality mixes can play a crucial role to help people OF MOBILE DEVICES WILL ENABLE real and reproduced sounds, requiradvance safely.
PAGE 101
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Department of Signal Processing and Acoustics at Aalto University, Espoo, Finland. His research interests include sound reproduction, headphone audio, and digital filtering. He was a member of the organizing committee of the 2013 Audio Engineering Society 51st International Conference on Loudspeakers and Headphones, Helsinki, Finland. Hannes Gamper (hannes.gamper@aalto.
PAGE 102
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Kaushik Sunder, Jianjun He, Ee-Leng Tan, and Woon-Seng Gan] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 103
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® convention for today’s digital media original sound scene, as well as the TO ACHIEVE NATURAL is still primarily a channel-based forindividual spectral characteristics of SOUND RENDERING, THE mat. Hence, the focus of this article the listener’s ears.
PAGE 104
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Headphone Listening Signal Processing Techniques Natural Listening Materials for Loudspeaker Playback Virtualization Recordings/Mixtures Sound Scene Decomposition Headphones Equalization Free Air Individual Filtering (Partial Ear) Individualization Individual Filtering (Torso, Head, Ear) Nonadapted Head Movements Head Tracking Head Movements Physical Sound Sourc
PAGE 105
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® in Figure 2(a)] have to be calibrated Furthermore, adding the reverA PERSONALIZED according to the head movements beration of sources (or the loudLISTENING EXPERIENCE (as in natural listening).
PAGE 106
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [TABLE 1] AN OVERVIEW OF TYPICAL TECHNIQUES IN BSS.
PAGE 107
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® potential components of sources [18]. Another technique applied in single-channel BSS is the computational auditory scene analysis (CASA) that simulates the segregation and grouping mechanism of the human auditory system [19] on the modelbased representation (monaural case) of the auditory scenes.
PAGE 108
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [TABLE 2] COMPARISON BETWEEN BSS AND PAE IN SOUND SCENE DECOMPOSITION. BSS OBJECTIVE PAE TO OBTAIN USEFUL INFORMATION ABOUT THE ORIGINAL SOUND SCENE FROM GIVEN MIXTURES AND FACILITATE NATURAL SOUND RENDERING.
PAGE 109
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Subject 131 Sound at Eardrum with Individual Pinna Features Natural Filter (a) Sound Source Individual Parameters HRTF Subject 133 Magnitude Sound Source Sound at Eardrum with Individual Pinna Features HRTF Individualization Process Subject 156 KEMAR 102 103 Frequency (Hz) (c) (b) 104 [FIG3] (a) Human ears act as a natural filter in physical listening.
PAGE 110
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Magnitude (dB/20 μPa) 120 110 100 90 80 70 60 50 102 Frontal Projection Response Frontal HRTFs HighFrequency Cues 103 Frequency (Hz) 104 5 kHz 16 kHz [FIG4] A comparison of the frontal projection headphone response and the frontal directional HRTFs measured on a dummy’s head. (Figure used courtesy of [33].) parameters [30].
PAGE 111
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [TABLE 4] EQUALIZATION TECHNIQUES FOR DIFFERENT PLAYBACK MODES (BINAURAL, STEREOPHONY). MODE OF EQUALIZATION NONDECOUPLED (BINAURAL) DECOUPLED (BINAURAL, STEREOPHONY) TYPES OF EQUALIZATION AND TARGET RESPONSE CHARACTERISTICS SPECTRUM AT EARDRUM IS THE INDIVIDUAL HRTF FEATURES CONVENTIONAL EQUALIZATION (FLAT TARGET RESPONSE) ■ FOR CONVENTIONAL HEADPHONES.
PAGE 112
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® where Z earcanal and Z headphones are the input impedances of the experiments [4], [38] showed that listeners prefer other alternative target responses more than the conventional FF and ear canal and the impedance of the headphone, respectively; DF equalizations.
PAGE 113
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® emitters for source rendering [33], are rendered in a manner so as to IN GENERAL, NATURAL and DF equalization is used to renrecreate a natural sound environSOUND RENDERING REQUIRES der environment signals over all the ment. Modeling the acoustics of the BOTH THE SPATIAL AND emitters.
PAGE 114
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® MOS for Four Measures Scatter Plot for All Scores 100 Preference of the Tracks 100 Natural Sound Rendering Conventional Stereo Natural Sound Rendering Mean Opinion Score 90 80 70 60 80 33% 60 61% 40 6% Prefer Natural Sound Rendering Not Sure Prefer Conventional Stereo 20 0 50 1 2 3 Measure (a) 4 0 50 Conventional Stereo (b) 100 (c) [FIG6] Results of the su
PAGE 115
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® AUTHORS Kaushik Sunder (KAUSHIK1@e.ntu.edu.sg) _______________ received his B.Tech degree in electrical and electronics engineering from the National Institute of Technology Karnataka, Surathkal, India, in 2011. He is currently pursuing his Ph.D. degree in electrical and electronics engineering at Nanyang Technological University, Singapore.
PAGE 116
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Vijay Parsa, João F. Santos, Kathryn Arehart, Oldooz Hazrati, [Tiago H. Falk,Rainer Huber, James M. Kates, and Susan Scollie ] EAR PHOTO—©ISTOCKPHOTO.COM/XRENDER ASSISTED LISTENING SIGN—© ISTOCKPHOTO.
PAGE 117
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® impairment, these subjects can become candidates for HA or CI devices. Recently, a number of factors, such as aging population, enlargement of candidacy criteria, and technological advances have drawn great attention to HA and CI research and development.
PAGE 118
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® where rk is the correlation coefficient between the reference and processed speech envelopes estimated in filter bank channel k (typically, 23 gammatone channels are used), and the [–15], [15] operator refers to the process of limiting and mapping SNR app into that range.
PAGE 119
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® PERCEPTION-MODEL-BASED QUALITY PREDICTION In its original version, the perception-model-based quality prediction method, PEMO-Q, compares the auditory-inspired “internal representation” of the reference speech signal to that of its processed counterpart to objectively characterize the quality of the processed speech signal [16].
PAGE 120
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® bank was replaced by the filter bank used in the speech coding strategy of the CI devices used by the listeners in the subjective test. Second, speech content variability was reduced by means of a modulation spectrum thresholding scheme [27].
PAGE 121
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Multiple Stimuli with Hidden Reference and Anchor (MUSHRA) quality scale, with “20” referring to poor quality and “100” representing excellent quality. Participants selected and listened to the reference and test stimuli and then indicated their quality judgments by adjusting the corresponding sliders on the computer screen.
PAGE 122
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® two metrics, respectively, along with their fitted sigmoidal curves. Table 2, in turn, presents the results obtained with seven intrusive and four nonintrusive measures on NONENHANCED ALL (NOISE/REVERB) ENHANCED the HA nonlinear frequency comt spear t sig f-RMSE t METRIC t t spear t sig f-RMSE t t spear t sig f-RMSE pression quality database. Note that NCM 0.68 0.74 0.87 9.
PAGE 123
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® DISCUSSION Table 4 summarizes the recommendations for metric usage based on distortion condition type (i.e., overall, nonenhanced, enhanced, NFC), assistive device (CI, HA), and the availability or unavailability of a reference signal (intrusive or nonintrusive).
PAGE 124
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 90 100 Not Enhanced Enhanced 80 Not Enhanced Enhanced 80 70 Quality Quality 60 60 40 50 40 30 20 20 10 0 0 0.2 0.4 0.6 0.8 1 0 0 0.2 PESQ (a) 0.4 0.6 0.
PAGE 125
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® latter case). In the scenario where nonlinear speech enhancement (noise suppression and dereverberation) was activated, three measures stood out: HASQI, PEMO-Q, and PEMO-Q-HI.
PAGE 126
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® strategies and the evaluation of signal processing algorithms with the goal of improving speech intelligibility and sound quality. She teaches courses in hearing science and audiology and is a certified clinical audiologist. Oldooz Hazrati (oldooz.hazrati@gmail.com) _________________ received the B.S.E.E. and M.S.E.E.
PAGE 127
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [Tuomas Virtanen, Jort F. Gemmeke, Bhiksha Raj, and Paris Smaragdis] COMPOSITIONAL MODELS for Audio Processing [Uncovering the structure of sound mixtures] M any classes of data are composed as constructive combinations of parts. By constructive combination, we mean additive combination that does not result in subtraction or diminishment of any of the parts.
PAGE 128
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Frequency (kHz) The compositional framework for also the flexibility to use them in THE BASIC PREMISE sound analysis builds upon these ways that are nonstandard in audio UNDERLYING THE APPLICATION impressions: it characterizes the processing.
PAGE 129
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 3 2 1 0 −0.8 0 4 3 2 1 0 −1 5 ICA Atom 1 4 3 2 1 0 0 0.5 −0.2 (a) PCA Activation 1 ICA Atom 2 5 Frequency (kHz) 4 0 4 3 2 1 0 0 0.
PAGE 130
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Time–frequency representations have a fundamental limitation: known that the human auditory system effectively acts as a filter the bandwidth, DF, of the filters, representing the minimum difbank [16] and that the amplitude of a signal is encoded by the nonnegative number of the firings of neurons [17] (even though ference in frequencies that can be resolved is inversely propor
PAGE 131
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® the sum of factors having a fixed spectrum a k and time-varying activation x k [t]. Representing the activation of the kth atom to all of the spectral vectors in Y as a vector x k = [x k [1] x k [2] g x k [T]] <, we can represent the overall contribution of a k to Y as a k x
PAGE 132
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® The various divergences scale differently with their arguments. The squared error scales quadratically, meaning that D SQ (aY | | aAX) = a 2 D SQ (Y | | AX), the IS divergence is scale invariant, i.e., D IS (aY | | aAX) = D IS (Y | | AX), while the KL divergence scales linearly: D KL (aY | | aAX) = aD KL (Y | | AX) .
PAGE 133
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® analysis frame is the probability distribution over k, which specifies how the component multinomials are chosen in any draw.
PAGE 134
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® additional requirement that atoms must be normalized after every iteration. There exists also ways to take the normalization into account in the update, which guarantee that the updates and normalization together decrease the value of the cost function [3], [31]. One of the most common constraints is that of sparsity, e.g., [4], [32], and [33].
PAGE 135
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® A s X *s , AX where the last term is the ratio of the contribution of the sth source to all the sources in each time–frequency point. This filter response is used by the well-known Wiener filter, and the reconstruction is often referred to as the Wiener-style reconstruction. If we wish to listen to these separated components, we need to convert them back to the time domain.
PAGE 136
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Frequency (kHz) Frequency (kHz) and sparsity-based methods both use the fact that atoms can the spectral character of each sound.
PAGE 137
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® e.g., in terms of the separation quality [25], provided that they may produce. A source may produce any number of distinct are appropriately acquired. The downside of larger dictionaries spectral structures. To accommodate all of them, the dictionary is, of course, increased computational complexity. must ideally be large.
PAGE 138
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® associate labels to atoms. When the dictionary is learned from data, x [t] is maximally sparse (contains only one nonzero entry), however, the appropriate mapping from atoms to labels is unclear. (17) is in fact identical to nearest neighbor classification.
PAGE 139
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 0.2x Zero +0.1x Noise +0.09x Zero +0.08x Two +0.08x Noise +... 0.7 0.6 Activation 0.5 0.4 0.3 0.2 0.1 0 0 1 2 3 4 5 6 7 8 9 oh Digit Labels [FIG8] By associating each dictionary atom from Figure 5 with a word label, the linear combination of speech atoms in Figure 5 serves directly as evidence for the underlying word classes.
PAGE 140
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Original Speech 4 Frequency (kHz) 3.5 3 2.5 2 1.5 1 0.5 0.5 1 1.5 Time (s) (a) 2 2.5 Missing Speech Reconstructed Using NMF 4 Missing 3 2.5 2 1.5 Original Frequency (kHz) 3.5 1 0.5 0.5 1 1.5 Time (s) (b) 2 2.5 [FIG9] An example of bandwidth extension of the spoken sequence of digits “nine five oh.” (a) The log-scaled spectrogram of the fullbandwidth signal.
PAGE 141
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® excitations, and a filter is learned to accommodate the dictionary to a new condition. Elementary Filter Atoms AUDIO DEREVERBERATION The excitation-filter model discussed in the previous section is only able to deal with filters whose length is smaller than one audio frame.
PAGE 142
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® are indexed by n, but now also with activations that correspond to the REVERBERATION missing frames. x, which is the frame index of the CAN BE FORMULATED AS A As in the previous example, short-time spectrogram segment. COMPOSITIONAL PROCESS. sounds typically have strong tempoAn illustration of the model is given ral and spectral dependencies. Temin Figure 12.
PAGE 143
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Log−Frequency Index f Observed Spectrogram Y 1,400 1,200 1,000 800 600 400 200 (a) 2,000 Frequency Shift τ 500 1,500 400 300 1,000 200 500 100 0 1 2 3 4 0 10 Amplitude (c) Time (s) (b) Log−Frequency Index f Atom a1 Activations x1, τ [t ] 0 20 [FIG13] An illustration of NMD in frequency.
PAGE 144
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Atoms ak Spectrograms of the Left and Right Channel Frequency (kHz) 4 3 2 1 0 50 40 30 20 10 Amplitude (a) (b) Channel Gains gk, c Activations xk [t ] Gain 4 0.5 Amplitude 6 1 2 0 Left Right 56 34 2 1 Component Index (c) 1 2 3 4 Time (s) (d) [FIG14] The tensor factorization of multichannel audio.
PAGE 145
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Nijmegen on the subject of noise well suited for modeling nonlinear THE ABILITY OF THE MODELS robust automatic speech recogniphenomena. Compositional models TO COUPLE ACOUSTIC AND OTHER tion (ASR) using missing data techuse iterative algorithms for finding TYPES OF INFORMATION ENABLES niques.
PAGE 146
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [11] M. D. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. E. Davies, “Sparse representations in audio & music: From coding to source separation,” Proc. IEEE, vol. 98, no. 6, pp. 995–1005, 2009. [40] A. Lefèvre, F. Bach, and C. Févotte, “Itakura-Saito nonnegative matrix factorization with group sparsity,” in Proc. IEEE Int. Conf.
PAGE 147
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Cichocki, Danilo P. Mandic, [ Andrzej Anh Huy Phan, Cesar F.
PAGE 148
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® concepts are supported by illustrative real-world case studies that highlight the benefits of the tensor framework as efficient and promising tools, inter alia, for modern signal processing, data analysis, and machine-learning applications; moreover, these benefits also extend to vector/matrix data through tensorization.
PAGE 149
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® X(:, :, k) X(1) J ... J ~ = I I X(:, j, :) K ... (SVD/PCA) X(2) Σ U1 I A2 I X(i, :, :) ...
PAGE 150
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® J x (0) x (1) x (2) gN K O K x (1) x (2) x (3) gO H=K O = a b % b, KK x (2) x (3) x (4) gOO h h L h P I = 26 + ··· + (2 × 2 × 2 × 2 × 2 × 2) (8 × 8) (64 × 1) (a) z0 z0 + Δz z0 + 2Δz y y0 + 2Δy y0 + Δy x 2Δ + x 0 Δx + x0 x0 z y0 x (b) [FIG2] Construction of tensors.
PAGE 151
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® λR λ1 X bR b1 ∼ = + ··· + = a1 BT D A aR br ar (I × R ) (I × J ) (R × R ) (R × J ) (a) c1 cR λ1 C λR + ··· + ∼ = b1 bR (K × R ) cr = BT A a1 aR (I × J × k ) (b) ar (R × R × R ) (I × R ) br (R × J ) [FIG3] The analogy between (a) dyadic decompositions and (b) PDs; the Tucker format has a diagonal core.
PAGE 152
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Cramér–Rao induced bound for the assessment of CPD performance were derived in [52] and [53].
PAGE 153
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® TUCKER DECOMPOSITION Figure 4 illustrates the principle of TKD, which treats a tensor X ! R I1 # I2 # g # I N as a multilinear transformation of a (typically dense but small) core tensor G ! R R 1 # R 2 # g # R N by the factor (n) (n) matrices B (n) = [b 1 , b 2 , f, b (Rnn)] ! R I n # R n, n = 1, 2, f, N [3], [4], given by X= R2 R1 RN / /g/ r1 = 1 r2 = 1 rN = 1 g r1 r2
PAGE 154
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® (K ) ~ = (I × J × K ) B1T A1 (K ) c1 + ··· + (I × L1) (L1 × J ) cR BRT AR (LR × J ) (I × LR) (a) C1 ~ = A 1 (I × J × K ) (I × L1) 1 (K × N1) B1T (M1 × J ) CR + ··· + AR R BRT (LR × MR × NR) (b) [FIG5] BTDs find data components that are structurally more complex than the rank-1 terms in CPD. (a) Decomposition into terms with multilinear rank (L r , L r , 1) .
PAGE 155
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q 0.1 0.1 0 0 s1 s1 THE WORLD’S NEWSSTAND® −0.1 −0.1 −0.2 −0.2 −0.3 0.05 0.1 0.15 0.2 0.05 0.1 Time (s) s ŝPCA ŝICA 0.15 0.2 Time (s) ŝCPD s ŝCPD (a) ŝTKD ŝBTD (b) 60 0.3 SAE (dB) 0.2 s2 0.1 0 40 20 −0.1 −0.2 0.05 0.1 0.15 0.
PAGE 156
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Sparse Vector W g y ~ = W(3) ⊗ W(2) ⊗ M3 = 32 = M1 = 585 M2 = 585 W(1) (l1 × l2 × l3) 32 I3 = Φ(3) (M3 × I3) Φ(2)T Measurement Vector (CS) (M1 × M2 × M3) I1 = 1,024 Sparse Vector Representation (Kronecker-CS) I2 = 1,0 24 (I2 × M2) Φ(1) (M1M2M3 × I1I2I3) (M1M2M3) (M1 × I1) (I1I2I3) (a) Vector Representation (a) (1,024 × 1,024 × 32) (256 × 256 × 32) Blo
PAGE 157
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [85]. The Kronecker-CS model has been applied in magnetic resonance imaging, hyperspectral imaging, and in the inpainting of multiway data [86], [84]. APPROACHES WITHOUT FIXED DICTIONARIES In Kronecker-CS, the modewise dictionaries B (n) ! R I n # I n can be chosen so as best to represent the physical properties or prior knowledge about the data.
PAGE 158
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® C(1) (1) ∼ = (1) B(1)T ... A(1) C C(k) (k) (1) ∼ = (k ) BT B(k )T ... A(k) C(K ) (K) (K) (k) A ∼ = A(K ) (K ) B(K )T [FIG10] Efficient computation of CPD and TKD, whereby tensor decompositions are computed in parallel for sampled blocks. These are then merged to obtain the global components A, B, and C, and a core tensor G.
PAGE 159
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® only require a limited, partial exploration of the data matrix. Tucker variants of this approach have been derived in [99]–[101] and are illustrated in Figure 11, while a cross-approximation for the TT format has been derived in [102].
PAGE 160
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® P(2) (I3 × L3) P(2) (I3 × L3) 1 ~ = R + ··· + P(1)T 1 t1 (I1 × I2 × I3) (L2 × I2) (I ) P(1)T tR R (L2 × I2) (I ) P(2) (I3 × RL3) = ... T P(1)T X (I1 × R) (R × RL2 × RL3) (RL2 × I2) Q(2) (J3 × L3) Q(2) (J3 × L3) 1 ~ = u1 (I1 × J2 × J3) R + ··· + (J ) Q(1)T 1 (L2 × J2) Q(1)T uR R (J ) (L2 × J2) Q(2) (J3 × RL3) = U ...
PAGE 161
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Data Acquisition Training Prediction Tensorization Time Motion Capture (Limb Trajectories) X(t ) Y(t ) Coordinates Z(t ) 3 6 4 7 ker 6 4 2 0 1 0 –1 –2 –3 2 1 0 –1 HOPLS Tensor Trajectory HOPLS (Regressions) PLS 20 40 60 Time (s) 80 100 20 40 60 Time (s) 80 100 20 40 60 Time (s) 80 100 Model Parameters 32 31 1 2 ECoG Layout 5 8 11 10 9 14 13 12
PAGE 162
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® B(3) B(3, 1) C B(1) B(1, 1) C I B(2)T ~ = (1) I B(3, 1) C B(1, 1) B(2, 1)T (1) B(2, 1)T I Sample Images from Different and Same Categories Training Data … … C I B(3, K ) BC(1) B(1, K ) (l3 × R3) I Test Sample B(2)T C ~ = (K ) LMWCA Apple B(3) B(3, K ) Common Features for Each Category B(1, K ) (K ) B(2, K )T B(2, K )T I LMWCA (R2 × I2) (l1 × R1)
PAGE 163
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ■ The estimation of the number of components in data and the assessment of their dimensionality would benefit from automation, especially in the presence of noise and outliers. ■ Both new theory and algorithms are needed to further extend the flexibility of tensor models, e.g.
PAGE 164
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [6] R. A. Harshman, “Foundations of the PARAFAC procedure: Models and conditions for an explanatory multimodal factor analysis,” UCLA Working Pap. Phonet., vol. 16, pp. 1–84, 1970. [37] A. Belouchrani, K. Abed-Meraim, J.-F. Cardoso, and É. Moulines, “A blind source separation technique using second-order statistics,” IEEE Trans. Signal Processing, vol. 45, no. 2, pp.
PAGE 165
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [64] M. Sørensen, L. De Lathauwer, P. Comon, S. Icart, and L. Deneire, “Canonical Polyadic Decomposition with orthogonality constraints,” SIAM J. Matrix Anal. Appl., vol. 33, no. 4, pp. 1190–1213, 2012. [65] M. Sørensen and L. De Lathauwer, “Blind signal separation via tensor decomposition with Vandermonde factor: Canonical polyadic decomposition,” IEEE Trans.
PAGE 166
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® [lecture NOTES] Dave Zachariah and Petre Stoica Cramér–Rao Bound Analog of Bayes’ Rule T he estimation of multiple parameters is a common task in signal processing. The Cramér–Rao bound (CRB) sets a statistical lower limit on the resulting errors when estimating parameters from a set of random observations.
PAGE 167
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS Now accepting paper submissions The new / dƌĂŶƐĂĐƚŝŽŶƐ ŽŶ ^ŝŐŶĂů ĂŶĚ /ŶĨŽƌŵĂƚŝŽŶ WƌŽĐĞƐƐŝŶŐ ŽǀĞƌ EĞƚǁŽƌŬƐ publishes high-quality papers that extend the classical notions of processing of signals defined over vector spaces (e.g.
PAGE 168
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [lecture NOTES] continued By applying the Schur determinant formula [8], [11] J a J ab G = J a J b - J ba J a-1 J ab = J ba J b = J b J a - J ab J -b 1 J ba , along with | J -1 | = | J | -1, to (5)–(7), we can now state the CRB analogs of the chain rule (3), CRB ^a, bh = CRB ^a | bh CRB ^ bh (8) and of Bayes’ rule (4), CRB ^ bh CRB ^ah = CRB ^ a | bh .
PAGE 169
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE TRANSACTIONS ON 1(: COMPUTATIONAL IMAGING The new IEEE Transactions on Computational Imaging seeks original manuscripts for publication. This new journal will publish research results where computation plays an integral role in the image formation process.
PAGE 170
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [lecture NOTES] continued As shown in [12], the Fisher information matrix can be decomposed into J i = Jr i + JJ i, where Jr i , shown in the box at the bottom of the page, contains the dominant terms and JJ i contains the remainder, so that J -i 1 - Jr i-1 for large n. Using this approximation we now analyze the bounds for A, B, and C by application of (9).
PAGE 171
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 49th Annual Asilomar Conference on Signals, Systems, and Computers November 8-11, 2015 www.asilomarssc.
PAGE 172
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ERRATA In the article “Image Processing and Analysis for Single-Molecule Localization Microscopy” by B. Rieger et al. in the January 2015 issue of IEEE Signal Processing Magazine [1], the two white circles in the gray boxes in Figure 4 were displaced due to a production error. The correct FIgure 4 appears below.
PAGE 173
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® YO U K N O W YO U R S T U D E N T S N E E D I E E E I N F O R M AT I O N . N O W T H E Y C A N H AV E I T . A N D Y O U C A N A F F O R D I T . IEEE RECOGNIZES THE SPECIAL NEEDS OF SMALLER COLLEGES, and wants students to have access to the information that will put them on the path to career success.
PAGE 174
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [standards IN A NUTSHELL] Siwei Ma, Tiejun Huang, Cliff Reader, and Wen Gao AVS2—Making Video Coding Smarter A VS2 is a new generation of video coding standard developed by the IEEE 1857 Working Group under project 1857.4.
PAGE 175
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE GlobalSIP’15—Call for Papers 2015 IEEE Global Conference on Signal and Information Processing – Orlando, Florida Digital Object Identifier 10.1109/MSP.2015.
PAGE 176
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [standards IN A NUTSHELL] continued Profile was accepted as an option of video codecs for Internet Protocol Television (IPTV) applications by the International Telecommunication Union–Telecommunication Standardization Sector (ITU-T) Focus Group on IPTV standardization [1].
PAGE 177
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® CU Partition 2Nd × 2Nd Split Flag = 0 Split Flag = 1 CU Depth, d=0 N0 = 32 2N0 0 1 2 3 PU Partition PU_Skip /Direct CU0 2N0 Split Flag = 0 Split Flag = 1 CU Depth, d=1 N1 = 16 2N1 0 1 2 3 d=0 d=2 0 1 2 3 2Nd × Nd Nd × 2Nd Nd × Nd 2Nd × nU 2Nd × nD nL × 2Nd nR × 2Nd PU_Inter d=3 Last Depth: No Splitting Flag 2N3 2Nd × 2Nd PU_Intra CU2 2N2 CU De
PAGE 178
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [standards IN A NUTSHELL] continued its location to the reference pixels applying the selected prediction direction. To improve the intraprediction accuracy, the subpixel precision reference samples must be interpolated if the projected reference samples locate on a noninteger position.
PAGE 179
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® backward, biprediction, and symmetric prediction, using two reference frames. In a B frame, in addition to the conventional forward, backward, bidirectional, and skip/direct prediction modes, symmetric prediction is defined as a special biprediction mode, wherein only one forward motion vector (MV) is coded and the backward MV is derived from the forward MV.
PAGE 180
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [standards IN A NUTSHELL] 2N × nU continued 2N × nD nL × 2N nR × 2N PU Others Level 0 2N × 2N 2N × 2N 2N × 2N Split Split Split TU Level 1 2N × 0.5N 0.5N × 2N [FIG6] A PU partition and two-level transform coding.
PAGE 181
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Deblocking filtering aims to remove the blocking artifacts caused by block transform and quantization. The basic unit for the deblocking filter is an 8 # 8 block. For each 8 # 8 block, the deblocking filter is used only if the boundary belongs to either of the CU, PU, or TU boundaries.
PAGE 182
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® continued Main Road 42 41 40 39 38 37 36 35 34 33 32 PSNR (dB) PSNR (dB) [standards IN A NUTSHELL] AVS2 HEVC 0 2,000 4,000 6,000 kb/s (a) 8,000 Over a Bridge 39 38 37 36 35 34 33 32 31 30 29 10,000 12,000 AVS2 HEVC 0 500 1,000 1,500 kb/s (b) 2,000 2,500 3,000 [FIG11] A performance comparison between AVS2 and HEVC for surveillance videos: (a) main road and (
PAGE 183
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® FREE SPS STUDENT MEMBERSHIP FOR 2015 You’re in the beginning stages of your career. Membership in the IEEE Signal Processing Society can help you lay the groundwork for many years of success.
PAGE 184
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [standards IN A NUTSHELL] continued the performance of AVS2 with three different coding configurations AI, RA, and LD, similar to the high-efficiency video coding (HEVC) common test conditions and Bjøntegaard delta bit rate is used for bit rate saving evaluation.
PAGE 185
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [advertisers INDEX] The Advertisers Index contained in this issue is compiled as a service to our readers and advertisers: the publisher is not liable for errors or omissions although every effort is made to ensure its accuracy. Be sure to let our advertisers know you found them through IEEE Signal Processing Magazine. ADVERTISER PAGE URL Asilomar Conference 169 www.
PAGE 186
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [dates AHEAD] Please send calendar submissions to: Dates Ahead, c/o Jessica Barragué IEEE Signal Processing Magazine 445 Hoes Lane Piscataway, NJ 08855 USA e-mail: ___________ j.barrague@ieee.org (Colored conference title indicates SP-sponsored conference.) 2015 [APRIL] Data Compression Conference (DCC) 7–9 April, Snowbird, Utah, United States. URL: http://www.cs.brandeis.
PAGE 187
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® USB & ETHERNET RF SWITCH MATRIX Switch position indicator lights DC to 18 GHz 385 $ from We’re adding more models and more functionality to our line of RF switch matrices. All models now feature switch cycle counting with automatic calibration interval alerts based on actual usage, an industry first! This function improves test reliability and saves you money.
PAGE 188
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Sprechen Sie MATLAB? Modeling electric potential in a quantum dot. Contributed by Kim Young-Sang at HYU. This example available at mathworks.com/ltc ® Image: Kim Young-Sang, Jeong Hee-Jun, Quantum Device Lab, Hanyang Univ. ©2012 The MathWorks, Inc. Over one million people around the world speak MATLAB.
PAGE 189
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE SIGNAL PROCESSING SOCIETY CONTENT GAZETTE [ISSN 2167-5023] MARCH 2015 Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND®
PAGE 190
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ASRU 2015 IEEE Automatic Speech Recognition and Understanding Workshop December 13-17, 2015 Scottsdale, Arizona, USA http://asru2015.org # " " # % ! #" " " !" .
PAGE 191
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ JANUARY 15, 2015 VOLUME 63 NUMBER 2 ITPRED (ISSN 1053-587X) REGULAR PAPERS Separable Beamforming For 3-D Medical Ultrasound Imaging http://dx.doi.org/10.1109/TSP.2014.2371772 ........ ........ ......... ......... .. .. ........ ......... ......... ........ ......... ......... ...... M. Yang, R. Sampson, S. Wei, T. F. Wenisch, and C.
PAGE 192
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Matrix-Monotonic Optimization for MIMO Systems http://dx.doi.org/10.1109/TSP.2014.2373332 . ......... ..... C. Xing, S. Ma, and Y. Zhou 334 Multi-Gb/s Software Decoding of Polar Codes http://dx.doi.org/10.1109/TSP.2014.2371781 ......... ....... B. Le Gal, C. Leroux, and C. Jego 349 Pattern-Coupled Sparse Bayesian Learning for Recovery of Block-Sparse Signals http://dx.doi.
PAGE 193
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ FEBRUARY 1, 2015 VOLUME 63 NUMBER 3 ITPRED (ISSN 1053-587X) REGULAR PAPERS Knowledge-Based Spatial-Temporal Hierarchical MIMO Radar Waveform Design Method for Target Detection in Heterogeneous Clutter Zone http://dx.doi.org/10.1109/TSP.2014.2366714 ..... ... B. Jiu, H. Liu, X. Wang, L. Zhang, Y. Wang, and B.
PAGE 194
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® On Degrees of Freedom Region of Three-User MIMO Interference Channel http://dx.doi.org/10.1109/TSP.2014.2379612 ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ ......... ......... .. L. Yang and W. Zhang 590 Massive MIMO Channel-Aware Decision Fusion http://dx.doi.org/10.1109/TSP.2014.2376886 ..... ....
PAGE 195
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ JANUARY 2015 VOLUME 23 NUMBER 1 ITASFA (ISSN 2329-9290) EDITORIAL Inaugural Editorial: Embracing New Opportunities for Growth http://dx.doi.org/10.1109/TASLP.2015.2390431 ...... ........ ......... .... H. Li 5 REGULAR PAPERS A Regression Approach to Speech Enhancement Based on Deep Neural Networks http://dx.doi.org/10.1109/TASLP.2014.
PAGE 196
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Inversion of Auditory Spectrograms, Traditional Spectrograms, and Other Envelope Representations ..... ......... . ........ ...... R. Decorsière, P. L. Søndergaard, E. N. MacDonald, and T. Dau 8QVXSHUYLVHG 6SHDNHU ,GHQWLÀFDWLRQ LQ 79 %URDGFDVW %DVHG RQ :ULWWHQ 1DPHV http://dx.doi.org/10.1109/TASLP.2014.2367822 .... ......... .. .. ........ ......... ......... ........ ......
PAGE 197
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ FEBRUARY 2015 VOLUME 23 NUMBER 2 ITASFA (ISSN 2329-9290) REGULAR PAPERS Time-Spread Echo-Based Audio Watermarking With Optimized Imperceptibility and Robustness ..... ......... ......... ........ . ........ ......... ........ ...... G. Hua, J. Goh, and V. L. L.
PAGE 198
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Multiple F0 Estimation and Source Clustering of Polyphonic Music Audio Using PLCA and HMRFs http://dx.doi.org/10.1109/TASLP.2014.2387388 ..... ......... ......... ........ ....... .. ......... ........ ......... ......... . V. Arora and L. Behera Resolution Warped Spectral Representation for Low-Delay and Low-Bit-Rate Audio Coder http://dx.doi.org/10.1109/TASLP.2014.2384279 ..
PAGE 199
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 7KH 1LQWK ,((( 6HQVRU $UUD\ DQG 0XOWLFKDQQHO 6LJQDO 3URFHVVLQJ :RUNVKRS WK WK -XO\ 5LR GH -DQHLUR %UD]LO *HQHUDO &KDLUV 5RGULJR & GH /DPDUH 38& 5LR %UD]LO DQG 8QLYHUVLW\ RI
PAGE 200
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® _________________________ JANUARY 2015 VOLUME 24 NUMBER 1 IIPRE4 (ISSN 1057-7149) PAPERS $Q (IÀFLHQW 05) (PEHGGHG /HYHO 6HW 0HWKRG IRU ,PDJH 6HJPHQWDWLRQ http://dx.doi.org/10.1109/TIP.2014.2372615 .... ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ X. Yang, X. Gao, D. Tao, X. Li, and J.
PAGE 201
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Winding Number Constrained Contour Detection http://dx.doi.org/10.1109/TIP.2014.2372636 ..... ......... ....... Y. Ming, H. Li, and X. He Cross-Camera Knowledge Transfer for Multiview People Counting http://dx.doi.org/10.1109/TIP.2014.2363445 ... ........ ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ... N. C. Tang, Y.-Y.
PAGE 202
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Structure-Guided Statistical Textural Distinctiveness for Salient Region Detection in Natural Images ....... ......... ......... ........ ......... ......... . C. Scharfenberger, A. Wong, and D. A. Clausi Random Forest Construction With Robust Semisupervised Node Splitting http://dx.doi.org/10.1109/TIP.2014.2378017 ... ......... ......... .. .. ........ ......... ......... ...
PAGE 203
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ FEBRUARY 2015 VOLUME 24 NUMBER 2 IIPRE4 (ISSN 1057-7149) PAPERS A Probabilistic Approach for Color Correction in Image Mosaicking Applications http://dx.doi.org/10.1109/TIP.2014.2375642 ... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ M. Oliveira, A. D. Sappa, and V.
PAGE 204
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® A Novel SURE-Based Criterion for Parametric PSF Estimation http://dx.doi.org/10.1109/TIP.2014.2380174 ....... ........ F. Xue and T. Blu Learning Templates for Artistic Portrait Lighting Analysis http://dx.doi.org/10.1109/TIP.2014.2369962 .... ......... ........ ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ....
PAGE 205
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING The new IEEE Transactions on Computational Imaging seeks original manuscripts for publication. This new journal will publish research results where computation plays an integral role in the image formation process.
PAGE 206
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ JANUARY 2015 VOLUME 10 NUMBER 1 ITIFA6 (ISSN 1556-6013) EDITORIAL Editorial http://dx.doi.org/10.1109/TIFS.2014.2381777 ....... ......... ........ ......... ......... ........ ......... ......... ........ ......... M. Barni 5 PAPERS Latent Fingerprint Enhancement via Multi-Scale Patch Based Sparse Representation http://dx.doi.org/10.
PAGE 207
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Security Enhancement of Cooperative Single Carrier Systems http://dx.doi.org/10.1109/TIFS.2014.2360437 ........ ........ ......... ......... .. .. ........ ......... ......... ........ ......... ......... ....... L. Wang, K. J. Kim, T. Q. Duong, M. Elkashlan, and H. V. Poor Learning Fingerprint Reconstruction: From Minutiae to Image http://dx.doi.org/10.1109/TIFS.2014.2363951 .
PAGE 208
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________________ FEBRUARY 2015 VOLUME 10 NUMBER 2 ITIFA6 (ISSN 1556-6013) PAPERS Low-Complexity Features for JPEG Steganalysis Using Undecimated DCT http://dx.doi.org/10.1109/TIFS.2014.2364918 ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ ......... ......... V. Holub and J.
PAGE 209
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® /,9( /LJKWZHLJKW ,QWHJULW\ 9HULÀFDWLRQ DQG &RQWHQW $FFHVV &RQWURO IRU 1DPHG 'DWD 1HWZRUNLQJ ...... ......... ......... ........ ......... ....... Q. Li, X. Zhang, Q. Zheng, R. Sandhu, and X. Fu Iris Recognition: What Is Beyond Bit Fragility? http://dx.doi.org/10.1109/TIFS.2014.2371691 ...... ......... .... ..... ........ ...... H.
PAGE 210
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE GlobalSIP'15-Call for Papers 2015 IEEE Global Conference on Signal and Information Processing ± Orlando Florida General Chairs: Jose Moura and Dapeng Oliver Wu Technical Program Chairs: Mihaela van der Schaar, Xiaodong Wang, and Hsiao-Chun Wu The IEEE Global Conference on Signal and Information Processing (GlobalSIP) is a recently launched flagship conference of the IE
PAGE 211
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ______________________________ JANUARY 2015 VOLUME 17 NUMBER 1 ITMUF8 (ISSN 1520-9210) EDITORIAL Message From the Editor-in-Chief http://dx.doi.org/10.1109/TMM.2014.2377871 ... ......... ........ ......... ......... ........ ...... C. W. Chen 1 PAPERS 3-D Audio/Video Processing Spatio-Temporal Video Segmentation of Static Scenes and Its Applications http://dx.doi.
PAGE 212
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Multimedia Search and Retrieval 4XHU\ 'LIÀFXOW\ (VWLPDWLRQ IRU ,PDJH 6HDUFK :LWK 4XHU\ 5HFRQVWUXFWLRQ (UURU http://dx.doi.org/10.1109/TMM.2014.2368714 .... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ ......... ...... X. Tian, Q. Jia, and T.
PAGE 213
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® ___________________ FEBRUARY 2015 VOLUME 9 NUMBER 1 IJSTGY (ISSN 1932-4553) ISSUE ON VISUAL SIGNAL PROCESSING FOR WIRELESS NETWORKS EDITORIAL Introduction to the Issue on Visual Signal Processing for Wireless Networks http://dx.doi.org/10.1109/JSTSP.2014.2355305 ........ ......... .. .. ........ ......... ......... ........ ......... . V. Velisavljević, B.
PAGE 214
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Quality-Control Algorithm for Adaptive Streaming Services Over Wireless Channels http://dx.doi.org/10.1109/JSTSP.2014.2331912 ...... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ ... S. García, J. Cabrera, and N. García Delay-Constrained Video Transmission: Quality-Driven Resource Allocation and Scheduling http://dx.doi.
PAGE 215
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ____________________ FEBRUARY 2015 VOLUME 22 NUMBER 2 ISPLEM (ISSN 1070-9908) LETTERS Stereo Matching with Optimal Local Adaptive Radiometric Compensation http://dx.doi.org/10.1109/LSP.2014.2350028 .. ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ..... L. Xu, O. C. Au, W. Sun, L. Fang, F. Zou, and J.
PAGE 216
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Change Detection with Compressive Measurements http://dx.doi.org/10.1109/LSP.2014.2352116 .. ......... ......... ........ ........ G. K. Atia Inverse Beamforming for Radio Tomography http://dx.doi.org/10.1109/LSP.2014.2353216 . ........ ......... .. ....... ........ ..... R. K. Martin Joint Object Segmentation and Depth Upsampling http://dx.doi.org/10.1109/LSP.2014.2352715 ....
PAGE 217
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ____________________ MARCH 2015 VOLUME 22 NUMBER 3 ISPLEM (ISSN 1070-9908) LETTERS Ü Bayram Penalty Functions Derived From Monotone Mappings http://dx.doi.org/10.1109/LSP.2014.2357681 ........ .......... ........ ........ I. A Fast Maneuvering Target Motion Parameters Estimation Algorithm Based on ACCF http://dx.doi.org/10.1109/LSP.2014.2358230 ...... .. .. ........ ....
PAGE 218
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Weakly Supervised Semantic Segmentation with a Multiscale Model http://dx.doi.org/10.1109/LSP.2014.2358562 . . .. S. Wang and Y. Wang Speeding Up Graph Regularized Sparse Coding by Dual Gradient Ascent http://dx.doi.org/10.1109/LSP.2014.2358853 ... ......... ......... .. .. ........ ......... ......... ........ ......... ......... ........ ......... ......... ........ ........
PAGE 219
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Xiamen, China, October 19 – October 21, 2015 http://www.mmsp2015.org Tentative Call for Papers General Chairs Xiao-Ping Zhang – Ryerson U, Canada Oscar C. Au – HKUST, Hong Kong MMSP 2015 is the 17th International Workshop on Multimedia Signal Processing.
PAGE 220
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [VOLUME 32 NUMBER 2 MARCH 2015] www.signalprocessingsociety.
PAGE 221
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® [CONTENTS] [VOLUME 32 NUMBER 2] [SPECIAL SECTION—SIGNAL PROCESSING TECHNIQUES FOR ASSISTED LISTENING] 16 FROM THE GUEST EDITORS Sven Nordholm, Walter Kellermann, Simon Doclo, Vesa Välimäki, Shoji Makino, and John R.
PAGE 222
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® FREE SPS STUDENT MEMBERSHIP FOR 2015 You’re in the beginning stages of your career. Membership in the IEEE Signal Processing Society can help you lay the groundwork for many years of success.
PAGE 223
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _________________ ___________________ __________ _____________________ _____________ _______________ ___________ ________________ ___________________ www.signalprocessingsociety.
PAGE 224
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® 25'(5 )250 )25 5(35,176 3XUFKDVLQJ ,((( 3DSHUV LQ 3ULQW LV HDV\ FRVW HIIHFWLYH DQG TXLFN &RPSOHWH WKLV IRUP VHQG YLD RXU VHFXUH ID[ KRXUV D GD\ WR RU PDLO LW EDFN WR XV 3/($6( ),// 287 7+( )2//2:,1* $XWKRU 5(7851 7+,6 )250 72 ,((( 3XEOLVKLQJ 6HUYLFHV 3XEOLFDWLRQ
PAGE 225
M q M q M q Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page MqM q THE WORLD’S NEWSSTAND® Start your membership immediately: Join online www.ieee.org/join 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. Please PRINT your name as you want it to appear on your membership card and IEEE correspondence. As a key identifier for the IEEE database, circle your last/surname.
PAGE 226
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Information for Authors (Updated/Effective September 17, 2014) The IEEE TRANSACTIONS ON SIGNAL PROCESSING is published online twice per month (semimonthly) covering advances in the theory and application of signal processing. The scope is reÀected in the EDICS: the Editor’s Information and Classi¿cation Scheme.
PAGE 227
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Author Misconduct Procedures: The procedures that will be used by the Signal Processing Society in the investigation of author misconduct allegations are described in the IEEE SPS Policies and Procedures Manual.
PAGE 228
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS Now accepting paper submissions The new / dƌĂŶƐĂĐƚŝŽŶƐ ŽŶ ^ŝŐŶĂů ĂŶĚ /ŶĨŽƌŵĂƚŝŽŶ WƌŽĐĞƐƐŝŶŐ ŽǀĞƌ EĞƚǁŽƌŬƐ publishes high-quality papers that extend the classical notions of processing of signals defined over vector spaces (e.g.
PAGE 229
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® _____________ www.signalprocessingsociety.
PAGE 230
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® ______________ Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND®
PAGE 231
Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND® Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page M q M q M q MqM q THE WORLD’S NEWSSTAND®