[Eeglablist] ICA "adds" noise?

Thu Jan 24 09:49:12 PST 2013

Hi all,

Since I'm using a BioSemi system, there is no reference electrode to 
begin with (save an implicit one approximately between the locations of 
CMS and DRL, I guess), and the data must always be re-referenced. There 
is no blank channel to include when calculating the average reference - 
or something to reconstruct with any other reference - so I guess by 
default you're dropping the implicit reference channel. Anyway, here's 
me going through a few steps with some of our data, with 64 scalp 
channels, 4 EOG channels, and 2 earlobe channels:

on import without any reference at all:
rank(double(EEG.data)) = 70

re-reference to average:
rank(double(EEG.data)) = 70

re-reference from average or from no reference to the average of the 
earlobes:
rank(double(EEG.data)) = 68

re-reference to average from the earlobes, without reconstructing 
reference channel:
rank(double(EEG.data)) = 68

re-reference to average from the earlobes, reconstructing reference channel:
rank(double(EEG.data)) = 69
(reduced from 70 because the earlobes are now the same)

So, with BioSemi, re-referencing straight to the average reference 
doesn't seem to reduce rank, and there's no way to keep a reference 
channel by default; however, the last value of the svd of the avg 
referenced data is 0...

I'd also note that the above ranks are calculated with matlab's rank 
function; if you use the alternative method included in pop_runica, the 
ranks it reports for the above data are 70, 69, 67, 67, and 68 
respectively, which are more in keeping with my naive expectations of 
how re-referencing would affect rank - numChans-1 (or 2). This is the 
source of the error message I mentioned in an earlier email, where 
EEGlab overrules the latter method and uses only the matlab rank 
command's estimates.

But to get back to what started it all, I guess Kristina's problem was 
solved already by Joe and Makoto - if you've referenced to multiple 
channels, and leave both in the data, you need to take one out or you 
may get the kind of duplicate components she saw, whereas my case was a 
little different, perhaps caused by undetected bridged electrodes. So 
it's not caused by any rank reduction introduced by (re-)referencing 
*per se* but by keeping multiple dependent channels in the data (of 
which a symptom would be reduced rank?), and the best thing is probably 
just to remove one of those channels. Is that a reasonable summary?

Anyway, thanks a lot for the discussion, been illuminating even if I've 
ended up overcomplicating it!

Cheers,
Matt

On 24/01/2013 02:45, Jason Palmer wrote:
> Hi Joseph et al.,
>
> I believe that any reference, average or channel, does reduce the rank
> by one. This is straightforward to show using linear algebra, e.g., here:
>
> http://sccn.ucsd.edu/wiki/Linear_Representations_and_Basis_Vectors#EEG_Data_Reference_and_Re-referencing
>
> The rank that matlab gives you depends on the tolerance used to declare
> small dimensions zero. E.g. a rank deficient matrix usually has smallest
> eigenvalue of around 1e-15 to 1e-8, due to numerical imprecision,
> particularly with large ill-conditioned matrices. You should see a
> sudden “drop off” though in the eigenvalue magnitudes after the
> theoretical rank.
>
> Best,
>
> Jason
>
> *From:*eeglablist-bounces at sccn.ucsd.edu
> [mailto:eeglablist-bounces at sccn.ucsd.edu] *On Behalf Of *Joseph Dien
> *Sent:* Wednesday, January 23, 2013 11:54 AM
> *To:* Matt Craddock
> *Cc:* eeglablist at sccn.ucsd.edu; Kristina Borgström
> *Subject:* Re: [Eeglablist] ICA "adds" noise?
>
> Hmmm…. I should know better than to talk off the top of my head like that…
>
> Well, part right, part wrong.
>
> It's easy enough to just try it out and see what Matlab says the rank is
> of the data.  I took some Cz-referenced 129-channel data and then
> rereferenced it to mean mastoid and to average reference.
>
>  >> rank(undata)
>
> ans =
>
>     128
>
>  >> rank(mmdata)
>
> ans =
>
>     128
>
>  >> rank(ardata)
>
> ans =
>
>     128
>
> so the bottom line is that average reference doesn't reduce the rank but
> neither does mean mastoid (my error).
>
> As for the wiki page you linked to, it's worded in a confusing way.
>   It's not that "the average reference reduces the rank of the data"
> necessarily.  What it should say is that it doesn't increase the rank of
> the data.  So if you start off with 128 recording channels and a
> reference channel (n=129) and rank is 128 (because voltage data is
> relative by definition so two channels only give you one waveform), then
> after average reference, the rank is still 128 even though it now looks
> as though you've got 129 channels with independent waveforms.  If you
> dropped the 129th reference channel and then computed an average
> reference channel, you would indeed lose another rank (as seen below)
> but that would be because you were doing the procedure incorrectly and
> had deleted a channel of meaningful information (even though it is
> flat).  The flat reference channel should always be included in the
> average reference computation.  The same goes for computing the mean
> mastoid reference (or any other rereference), although unfortunately a
> lot of systems throw that information away.
>
>  >> rank(ar128data)
>
> ans =
>
>     127
>
> I definitely need to look into the effects of bridging more closely,
> especially for frequency-based applications.  This has been a very
> helpful discussion!
>
> Joe
>

-- 
Dr. Matt Craddock

Post-doctoral researcher,
Institute of Psychology,
University of Leipzig,
Seeburgstr. 14-20,
04103 Leipzig, Germany
Phone: +49 341 973 95 44