A simple FastICA example

Posted on 2009-11-22 by Endolith

Wikipedia describes independent component analysis as “a computational method for separating a multivariate signal into additive subcomponents supposing the mutual statistical independence of the non-Gaussian source signals”. (Clearly, this was written as part of their campaign to make technical articles accessible.)

In normal people words, ICA is a form of blind source separation — a method of unmixing signals after they have been mixed together, without knowing exactly how they were mixed. It’s not as bad as Wikipedia makes it sound. It’s just the signal processing equivalent of this:

One of the problems I always have with learning stuff like this is the lack of clear examples. They exist, but they’re not generally very good. (And why do researchers always work with awful noisy 3-second 8 kHz recordings?) So, upon getting working results, I wrote up this little example. This is in Python and requires the MDP (python-mdp in Ubuntu) and Audiolab packages (sudo easy_install scikits.audiolab).

In order for ICA to work, it requires at least one different recording for each signal you want to unmix. So if you have two musical instruments playing together in a room, and want to unmix them to get separate recordings of each individual instrument, you’ll need two different recordings of the mixture to work with (like a stereo microphone). If you have three instruments playing together, you’ll need three microphones to separate out all three original signals, etc. So, first, create the mix:

Find or make two mono sound files. I just used clips of music.
Mix them together to a stereo track, with both sounds mixed into both channels, but with each panned a little differently, so the two channels are not identical. They should sound all jumbled together, but the left channel should sound slightly different from the right.
Save in a format that libsndfile can read, like FLAC or WAV (not mp3):
- Mixed music
- [audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Mixed-NIN-and-Mazzy-Star.mp3]

Alternatively, just mix them in Python:

sig1, fs1, enc1 = wavread('file1.wav')
sig2, fs2, enc2 = wavread('file2.wav')
mixed1 = sig1 + 0.5 * sig2
mixed2 = sig2 + 0.6 * sig1

So now you have the mixed signals, and you can pretend you don’t know how they were mixed. To unmix them automatically, run something like this in Python:

from mdp import fastica
from scikits.audiolab import flacread, flacwrite
from numpy import abs, max

# Load in the stereo file
recording, fs, enc = flacread('mix.flac')

# Perform FastICA algorithm on the two channels
sources = fastica(recording)

# The output levels of this algorithm are arbitrary, so normalize them to 1.0.
sources /= max(abs(sources), axis = 0)

# Write back to a file
flacwrite(sources, 'sources.flac', fs, enc)

The output has each signal in its own channel:

Demixed music

You can hear some crosstalk, but it’s pretty good:

[audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Unmixed-Mazzy.mp3]
[audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Unmixed-NIN.mp3]

For more than two sources, I just read them in separately and combined them in Python:

rec1, fs, enc = flacread('Mixdown (1).flac') # Mono file
rec2, fs, enc = flacread('Mixdown (2).flac')
rec3, fs, enc = flacread('Mixdown (3).flac')

sources = fastica(array([rec1,rec2,rec3]).transpose())

flacwrite() has no problem writing multi-channel files.

Mixed speech:

[audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Mix.mp3]

After demixing, there’s very little crosstalk, though the noise floor increases considerably. This seems to be the case when the mixes are very similar:

[audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Source-1.mp3] [audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Source-2.mp3] [audio:http://www.endolith.com/wordpress/wp-content/uploads/2009/11/Source-3.mp3]

Although this method was recommended to me for real-life audio signals and microphones, as I’ve described above, it turns out that ICA doesn’t actually work well when the signals occur at different delays in the different sensor channels; it assumes instantaneous mixing (that the signals are in perfect sync with each other in all the different recordings). Delay would happen in a real-life situation with performers and microphones, since each source is a different distance from each microphone. This is exactly the application I had in mind, though, so I don’t really have any further interest in ICA…

48 thoughts on “A simple FastICA example”

Puneet Mishra on 2010-03-30 at 8:20 am said:

AS you said that it unmixes the signal into independent statistical way only when we have two sources to mix them up.
Can’t this unmix a EEG signal which contains ECG or EMG artefacts in it.
I searched alot many IEEE papers but no one illustrated well about the proper way to soplve this problem.Kindly publish another example while showing that ECG OR EMG artefact removal from EEG data.
Thanks!

Reply ↓
- Endolith on 2010-03-30 at 9:01 am said:
  
  I’d imagine that taking multiple channels of EEG and then demixing them with ICA would separate out the different parts you want, but I’m not an expert on this stuff, sorry. It might not be the right kind of problem for ICA to solve. Search for “blind source separation” instead of ICA to see if there are better algorithms.
  
  Reply ↓
Matt Gattis on 2010-05-31 at 9:49 pm said:

I was just trying out some similar examples of using fastica and I found your site. Totally agree about the lack of examples.

Anyway, the thing I’m wondering is why you need N channels to produce N separated signals. Why can’t ICA just work on one signal and separate out the components? Seems a bit roundabout to have to copy a second channel and then pan it.

I want to build an app that takes in a song and separates out three signals (vocals, melody, and percussion). It would be very useful to DJs because pretty much no such software exists currently.

Reply ↓
Endolith on 2010-05-31 at 10:06 pm said:

How would it know what the two signals were if it only had one to work with? It uses the difference between the two mixed signals to figure out what the two original signals were.

Reply ↓
Matt Gattis on 2010-05-31 at 10:34 pm said:

I guess I should read more about it… I thought by doing a form PCA it was separating out frequencies that covaried in amplitude together. I didn’t realize that its doing some kind of comparison between sensors.

Its definitely possible to do this with just one signal, but maybe ICA is not what I’m looking for. I would suppose it would work in the same way your brain separates out a conversation from a bunch of people talking in the same room (granted you have two ears to sample from but I’m pretty sure you would still have the ability if you were deaf in one ear).

Reply ↓
Matt Gattis on 2010-05-31 at 10:40 pm said:

lol I just tried this with Biggie’s “Juicy” and it separates Puff Daddy saying “uh uh yea thats right” really annoyingly into one channel and everything else into the other channel. It sounds hilarious.

Reply ↓
Endolith on 2010-06-01 at 12:01 pm said:

Well, in order to extract two signals from one signal, you need a model of what type of signal to expect. If one signal is all low frequencies and the other all high frequencies, you could separate them with a simple filter, for instance. But if you don’t know anything specific about the signals, you’re not going to be able to separate them.

Yes, you could do this the way humans do it. All you’d have to do is write software to simulate a human brain. I’d be very interested in getting a copy of this, if you do it. 😀

Reply ↓
Jeremy on 2011-11-30 at 9:44 pm said:

Hi, I am trying to use your code in a real-world recording. I use two mono microphone to record simultaneously into a stereo mic in.

So the left channel of the audio file now contains the signal from mic 1 and the right channel from mic 2.

I want to ask how should I prepare the file so that “Mix them together to a stereo track, with both sounds mixed into both channels, but with each panned a little differently”?

Thanks.

Reply ↓
- Endolith on 2011-12-01 at 1:41 am said:
  
  I think I used Adobe Audition and mixed the two tracks together to a stereo mix, with each panned differently. Then Left and Right both contain both signals, but not at the same levels.
  
  Reply ↓
Jeremy on 2011-12-01 at 1:06 pm said:

What does it mean by “with each panned differently”?

Do you think it should work on real-world recording with one microphone for each channel?

Reply ↓
- Endolith on 2011-12-01 at 1:12 pm said:
  
  I mean, for instance, that Mic 1 is 100% in Left channel, and 50% in Right channel, while Mic 2 is 100% in Right channel, and 50% in Left channel. You need both signals in both channels, but not at the same level.
  
  Yes it should work fine for real-world microphone recordings, as long as they were recorded independently and mixed without any delay. Mixing them like this is not realistic, though. In real life, if you are recording 2 sources with 2 microphones at the same time, there will be slight delay differences between the microphones, which ICA does not handle as well.
  
  Reply ↓
Jeremy Salwen on 2012-04-01 at 1:53 am said:

Don’t give up so fast: some modified ICA algorithms that do work

Reply ↓
Endolith on 2012-04-04 at 8:55 pm said:

I’ve since discovered the DUET algorithm, which seems very similar to my original idea: Breaking up the two signals with STFT and comparing phase and amplitude differences to guess at their origin in space, and then cluster nearby points and reconstruct the signal from only those STFT components.

Reply ↓
Billy Chan on 2012-05-12 at 3:00 am said:

Hi,
Your demonstration here is just great ,I really appreciate it. Thanks a lot!
But a simple problem arise when I try to use fastica in Matlab. here is the code;

>> w1=wavread(‘C:\Users\BillyChan\Desktop\1.wav’);
>> w2=wavread(‘C:\Users\BillyChan\Desktop\2.wav’);
>> mix1=0.8*w1+0.2*w1;
>> mix2=0.8*w1+0.2*w1;
>> test=mix1′;
>> test=[test ; mix2′];
>> [icasig, A, W]=fastica(test,’numOfIC’,2);

Here what I want to do is to separate the two independent signal,i.e w1 and w2, from the mixed-signal,just as what you did in this demonstration. But what i really got in icasig is only one signal, not two signal. What really happened? Am I doing the right thing? I would really appreciate it if you can give me some hints.
Thanks a lot!

Reply ↓
- Endolith on 2012-05-12 at 11:02 am said:
  
  Well, as you wrote it, mix1 and mix2 are both identical to w1, so there’s nothing to separate. It should be something like mix1=0.8*w1+0.2*w2; mix2=0.1*w1+0.7*w2; Both sources should be in both mixes, but at different levels.
  
  Reply ↓
Billy Chan on 2012-05-13 at 7:08 am said:

Oops…I made a mistake in the code. >_<
Thanks a lot!

Reply ↓
Pera on 2013-10-20 at 7:21 pm said:

Hello;
I’am a PHD student in computer science, and i’m working on the Blind source separation, i read abot SOBI algorithm, but i have somme difficulties to understand it, please if you know this algorithm , can you help me to know how can i get just the covariance matrix please ?

Cordially

Reply ↓
Pera on 2013-10-20 at 7:26 pm said:

Just this question

Reply ↓
Ricardo_mgg on 2013-11-18 at 8:07 am said:

Hi, we are working in matlab with a fast ica code(similar to the one that Billy Chan used, I suppose), and we’re having some problems with the output signal.
[mistura,Fs, N]=wavread(‘C:\User\Downloaded\Mixed_NIN_and_Mazzy_Star_converted.wav’);
[icasig] = fastica (mistura’, ‘numofIC’,2);
…
wavwrite(icasig(1,:)’,’C:\User\Downloaded\sinal.wav’);
wavwrite(icasig(2,:)’,’C:\User\Downloaded\sinal2.wav’);
our output signal is something not similar to any of the expected output.And we get the following warnings:
Warning: Data clipped during write to file:sinal2.wav
> In wavwrite>PCM_Quantize at 280
In wavwrite>write_wavedat at 302
In wavwrite at 139
Do you know what’s the problem?
Thanks a lot!

Reply ↓
- Endolith on 2013-11-20 at 9:13 pm said:
  
  It says “data clipped”, so I would guess that your data is clipped. 🙂 Did you normalize the signal level before writing to the wav file?
  
  Reply ↓
anusha on 2014-02-27 at 1:18 pm said:

hi every one..
I am doing my final year project on FASTICA algorithm. can any one please give me the program for FASTICA in matlab?
it would be very helpful…
Thank you

Reply ↓
Andrew on 2014-03-30 at 2:40 pm said:

Hi,
I’ve managed to get working code for ICA on Matlab, but what would be the main alteration from ICA to PCA, or preferably does anyone have example code for PCA, so I can compare the two?

Many thanks.

Reply ↓
Wonwo Park on 2014-04-02 at 3:51 am said:

Hi, Thank you for your good information.
The Video is wonderful Concept to me.

I have only one mixed signal.
But Is it possible to ICA Analysis, in that case ?

So, Can I separate the original signal ??

I hope so. ^^;

Thanks in advance.

Reply ↓
- Endolith on 2014-04-05 at 8:55 pm said:
  
  No, you need more than one mix of the signal for ICA.
  
  Reply ↓
Enno de Lange on 2014-04-16 at 3:28 am said:

Great example! I use (an adapted version of) it in a signal processing class I teach. Just reading through the comments the first one from Puneet Mishra struck me as pretty hilarious at first (the guy pretty much asks the author to do his PhD for him), but then I realized some of the people reading this may not know that the subject of “unmixing signals” is actually a very nontrivial problem. Please be aware that all of this is very much an active area of research and much is unknown.

Basically, ICA, PCA and all other linear techniques only work in very specific, often artificial, examples. Everybody knows for instance that you cannot “unmix” your milk from your coffee by stirring backwards. The reason it works in the youtube video above is that they have a very viscous fluid (the video mentions that the Reynold number is < 1 and they have a laminar flow–laminar flow is the fluid dynamics term for linearity).

This is also the reason that the demo above works, but that it does not work for the practical purpose the author had in mind (unmixing real world recordings).

As for Puneet's question: EEG is an incredibly complex multivariate recording of a nonlinear source with unknown (high) dimensionality. How ECG and EMG artifacts mix into those recordings is very complex, nonlinear and largely unknown. The artifacts can perhaps be filtered out (partially) by taking into account the specific time and frequency characteristics ("patterns") of the ECG and EMG, but that will require advanced, custom-made algorithms. Out-of-the-box ICA will definitely not work.

Reply ↓
janeth on 2014-09-30 at 6:31 pm said:

Hi, How to get the array w in the algorithm in python FasTICANode…….i want to know the matriz W but i dont kwon how this array in python. Thanks

Reply ↓
sichangi on 2015-04-22 at 11:24 am said:

Hi, how can one use ICA for signal denoising and dimension reduction .Thanks

Reply ↓
sichangi on 2015-04-22 at 11:28 am said:

Hi,how can one use ICA for signal denoising and dimension reduction.Please help on how to write the code in matlab.I am a new user

Reply ↓
Viral on 2015-12-02 at 2:28 am said:

Really informative piece.
I have implemented ICA algorithm using maximum likelihood estimate. I used your provided .flac file and was able to separate the sources.

I have following questions for you.
1) How did you generate the .flac/wav file. I tried doing so by getting 2 mp3 files and converting them to .wav file. But each of the .wav file had multiple channels and they weren’t of same length. I guess we can clip the length but will it be fine to just pick one of the many channels?.

Reply ↓
- Endolith on 2015-12-10 at 8:48 pm said:
  
  It says how I made it in the text right above it.
  
  Find or make two mono sound files. I just used clips of music.
  Mix them together to a stereo track, with both sounds mixed into both channels, but with each panned a little differently, so the two channels are not identical. They should sound all jumbled together, but the left channel should sound slightly different from the right.
  
  You can also just mix them in your matlab/python software, as I also described above it.
  
  Reply ↓
Bruno R. de Oliveira on 2015-12-26 at 7:18 am said:

Has other algorithm which uses second order statistics: AMUSE.

Here have your implementation in Python: http://dspandmath.blogspot.com.br/2015/12/blind-source-separation-with-python.html

Reply ↓
Sriranjan on 2016-04-16 at 10:32 am said:

I am trying to get your code running with my audio files. But I get an error saying
”
wavwrite(np.array([mixed1, mixed2]).T, ‘mixed.wav’,fs2, enc2,dtype=object)
ValueError: setting an array element with a sequence.
”

The code works fine with your inputs.I recorded two audio files simultaneously from two mics but I get this error. Why is it throwing an error with my audio files but not yours. The file formats are the same fs = 44100 and enc = pcm16. Please let me know.

Reply ↓
- Endolith on 2016-04-16 at 11:19 am said:
  
  why does it say dtype=object?
  
  post all of your code
  
  Reply ↓
Sriranjan on 2016-04-16 at 10:40 am said:

The only difference is that the files are slightly of different sizes. One is 1,701 kb and the other is 1,696 kb. How will I overcome this.

Reply ↓
Sriranjan on 2016-04-17 at 1:56 am said:

That error got solved when I used an WAV cutter and cut both the audio wav files to the same size. I put that dtype=object by mistake. The code runs fine without it. But now I am using the wave files generated by your code and I get the following error.
”
File “C:\Python26\fastica.py”, line 21, in
wavwrite(np2.array([mixed1, mixed2]).T, ‘mixed3.wav’,fs1,enc1)
File “C:\Python26\lib\site-packages\scikits\audiolab\pysndfile\matapi.py”, line 47, in basic_writer
hdl = Sndfile(filename, ‘w’, uformat, nc, fs)
UnboundLocalError: local variable ‘nc’ referenced before assignment
“

Reply ↓
- Endolith on 2016-04-18 at 10:23 pm said:
  
  I don’t know what that nc error is, but I’ve been using PySoundFile lately, I always had issues getting scikits.audiolab to work reliably.
  
  Reply ↓
Sriranjan on 2016-04-17 at 2:02 am said:

Full code here:

from mdp import fastica from scikits.audiolab import wavread, wavwrite from numpy import abs, max from array import array import numpy as np2 import time
start_time = time.time() nc = 0 sig1, fs1, enc1 = wavread('mixed1 1_2.wav') print(fs1) print(enc1) sig2, fs2, enc2 = wavread('mixed2 1_2.wav') print(fs2) print(enc2) mixed1 = sig1 mixed2 = sig2 #mixed = np.array([mixed1,mixed2]).T #mixed = mixed.tolist() #wavwrite(mixed, 'mixed.wav',fs2, enc2) wavwrite(np2.array([mixed1, mixed2]).T, 'mixed3.wav',fs1,enc1) #wavwrite(np.array([mixed1, mixed2]).T, 'mixed.wav',dtype = list) # Load in the stereo file recording, fs, enc = wavread('mixed3.wav') # Perform FastICA algorithm on the two channels sources = fastica(recording) # The output levels of this algorithm are arbitrary, so normalize them to 1.0. sources /= max(abs(sources), axis = 0) # Write back to a file wavwrite(sources, 'sources.wav', fs, enc)
print("%s seconds"%(time.time()-start_time))

Reply ↓
Sriranjan on 2016-04-17 at 2:02 am said:

I can mail you my audio files. My email id is infibit@gmail.com

Reply ↓
Vamsi on 2016-04-18 at 9:41 pm said:

Hey @Enno and @Endolith:
As you said that delays are different when you’re trying to do FASTICA on a real world clip and the algorithm assumes the two rec to be in a perfect sync, hence it doesnt work. Then, how come the audio clippings you’re using are in perfect sync (as they’re also recordings after all)… I am a bit confused.

Reply ↓
- Endolith on 2016-04-18 at 9:55 pm said:
  
  I’m taking 2 different monaural recordings and then making 2 different monaural mixes of them. So the mixes contain the same material, in sync, but at different levels. The only difference is the amount of each in the mix.
  
  Reply ↓
Vamsi on 2016-04-18 at 10:02 pm said:

@Endolith
Thanks for such a quick reply…Again to clarify what is actually in sync here?

Reply ↓
- Endolith on 2016-04-18 at 10:24 pm said:
  
  imagine that signal 1 is a sine wave that starts at 0 and signal 2 is an impulse at 5 seconds. mix 1 is 1/2 times the sine wave + 2 * the impulse. mix 2 is 3 * the sine wave + 1 times the impulse. both mixes have the impulse at 5 seconds, but at different amplitudes, and both mixes have the sine wave starting at 0, but at different amplitudes.
  
  Reply ↓
  - Vamsi on 2016-04-18 at 10:53 pm said:
    
    Ohk…I am slowly getting the idea. Will I be able to do fastICA on two separate voice recordings and then mixing them together?
    
    Reply ↓
  - Vamsi on 2016-04-18 at 11:17 pm said:
    
    And I had one more doubt, FastICA is giving only one output, what do I do if I want to get the other one?
    
    Reply ↓
janeth on 2017-05-11 at 1:11 am said:

Friends, are you Know the algorithm FastIcaNode for python?. I used this but I still not understand some things, for example, if this code needs the Signal whitening, what is the meaning of signal whitening?

Reply ↓
Aj on 2018-04-23 at 8:55 am said:

https://arxiv.org/abs/1404.2986

Read this Janet

Reply ↓
Pingback: 语音识别研究的四大前沿方向 - 算法网
Pingback: 语音识别中的鸡尾酒会问题 – 源码巴士

nothing to see here

move along

A simple FastICA example

48 thoughts on “A simple FastICA example”

Leave a Reply to Ricardo_mgg Cancel reply