combining 3 nights data through factor #194

soumyajitmandal · 2017-02-24T10:12:26Z

hi everyone,

I am combining 3 nights of data (full subband) with factor. It was going fine until the first amplitude calibration. Probably interpolating them in different time flagged all the data. Here is the message:

2017-02-23 18:20:23 WARNING facetselfcal_facet_patch_809.executable_args:   from _parmdb import ParmDB
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:227: RuntimeWarning: overflow encountered in power
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   amp = 10**amp
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:235: RuntimeWarning: overflow encountered in power
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   return amp_clean, 10**(model[ndata:ndata + ndata]), noisevec[ndata:ndata + ndata], scatter, n_knots, idxbad, weights[ndata:ndata + ndata]
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:461: RuntimeWarning: overflow encountered in square
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   channel_parms_imag[chan]**2) for chan in range(nchans)])
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:273: RuntimeWarning: divide by zero encountered in log10
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   amp = numpy.log10(amp)
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:78: RuntimeWarning: invalid value encountered in subtract
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   scatter = numpy.median(abs(shifted_vec - datavector))
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /software/rhel7/lib64/python2.7/site-packages/numpy/lib/function_base.py:3569: RuntimeWarning: Invalid value encountered in median
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   RuntimeWarning)
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:286: RuntimeWarning: invalid value encountered in greater
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   idxbad = numpy.where((numpy.abs(amp - amp_median)) > scatter*3.)
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:287: RuntimeWarning: invalid value encountered in multiply
2017-02-23 18:21:00 WARNING facetselfcal_facet_patch_809.executable_args:   baddata = numpy.copy(amp)*0.0
2017-02-23 18:21:48 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:227: RuntimeWarning: overflow encountered in power
2017-02-23 18:21:48 WARNING facetselfcal_facet_patch_809.executable_args:   amp = 10**amp
2017-02-23 18:21:48 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:235: RuntimeWarning: overflow encountered in power
2017-02-23 18:21:48 WARNING facetselfcal_facet_patch_809.executable_args:   return amp_clean, 10**(model[ndata:ndata + ndata]), noisevec[ndata:ndata + ndata], scatter, n_knots, idxbad, weights[ndata:ndata + ndata]
2017-02-23 18:23:07 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:227: RuntimeWarning: overflow encountered in power
2017-02-23 18:23:07 WARNING facetselfcal_facet_patch_809.executable_args:   amp = 10**amp
2017-02-23 18:23:07 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:235: RuntimeWarning: overflow encountered in power
2017-02-23 18:23:07 WARNING facetselfcal_facet_patch_809.executable_args:   return amp_clean, 10**(model[ndata:ndata + ndata]), noisevec[ndata:ndata + ndata], scatter, n_knots, idxbad, weights[ndata:ndata + ndata]
2017-02-23 18:23:11 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:546: RuntimeWarning: invalid value encountered in multiply
2017-02-23 18:23:11 WARNING facetselfcal_facet_patch_809.executable_args:   numpy.cos(phase) * norm_factor)
2017-02-23 18:23:11 WARNING facetselfcal_facet_patch_809.executable_args: /net/para14/data1/mandal/software/factor/factor/scripts/smooth_amps_spline.py:548: RuntimeWarning: invalid value encountered in multiply
2017-02-23 18:23:11 WARNING facetselfcal_facet_patch_809.executable_args:   numpy.sin(phase) * norm_factor)
2017-02-23 18:23:12 DEBUG   facetselfcal_facet_patch_809.executable_args: smooth_amps_spline.py: Normalization-Factor is: 0.0
2017-02-23 18:23:12 DEBUG   facetselfcal_facet_patch_809.executable_args: Results for job 0 submitted by ('132.229.226.24', 52708)
2017-02-23 18:23:12 INFO    node.lofar6.strw.leidenuniv.nl.python_plugin: Total time 169.6410s; user time: 123.5809s; system time: 454.9132s
2017-02-23 18:23:12 DEBUG   node.lofar6.strw.leidenuniv.nl.python_plugin: Start time was 1487870422.8153s; end time was 1487870592.4575s
2017-02-23 18:23:12 DEBUG   facetselfcal_facet_patch_809.executable_args:

Finished preparing output MS
MSReader
  input MS:       /net/para14/data1/mandal/FACTOR_3nights/workingdir/results/facetselfcal/facet_patch_809/L274099_SB000_uv.dppp.pre-cal_12600ED58t_121MHz.pre-cal_chunk0_12600ED58t_4g.mssort_into_Groups
  band            0
  startchan:      0  (0)
  nchan:          4  (0)
  ncorrelations:  4
  nbaselines:     1891
  ntimes:         300
  time interval:  8.01112
  DATA column:    CORRECTED_DATA
  WEIGHT column:  WEIGHT_SPECTRUM
  autoweight:     false
ApplyCal correct_slow.
  parmdb:         /net/para14/data1/mandal/FACTOR_3nights/workingdir/results/facetselfcal/facet_patch_809/L340794_SB000_uv.dppp.pre-cal_126400A74t_121MHz.pre-cal_chunk12_126407AFCt_4g.smooth_amp1
  correction:     gain
    Ampl/Phase:   false
  update weights: false
  sigmaMMSE:      0
  invert:         true
  timeSlotsPerParmUpdate: 500
Averager avg.
  freqstep:       1  timestep:       15
  minpoints:      1
  minperc:        0
MSWriter msout.
  output MS:      /net/para14/data1/mandal/FACTOR_3nights/workingdir/results/facetselfcal/facet_patch_809/L274099_SB000_uv.dppp.pre-cal_12600ED58t_121MHz.pre-cal_chunk0_12600ED58t_4g.apply_amp1
  nchan:          4
  ncorrelations:  4
  nbaselines:     1891
  ntimes:         20
  time interval:  120.167
  DATA column:    DATA
  WEIGHT column:  WEIGHT_SPECTRUM
  Compressed:     no

Processing 300 time slots ...
Finishing processing ...

NaN/infinite data flagged in reader
===================================

Percentage of flagged visibilities detected per correlation:
  [0,0,0,0] out of 2269200 visibilities   [0%, 0%, 0%, 0%]
0 missing time slots were inserted

Flags set by ApplyCal correct_slow.
=======================

Percentage of visibilities flagged per baseline (antenna pair):
 ant    0    1    2    3    4    5    6    7    8    9   10   11   12   13   14
   0         0% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
   1    0%      100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
   2  100% 100%        0% 100%   0%   0%   0%   0%   0% 100%   0% 100% 100% 100%
   3  100% 100%   0%        0%   0%   0%   0% 100%   0% 100%   0%  90% 100% 100%
   4  100% 100% 100%   0%        0%  98%  28% 100% 100% 100% 100%   0% 100% 100%
   5  100% 100%   0%   0%   0%      100%   0% 100%   0% 100%   0%   0%   0% 100%
   6  100% 100%   0%   0%  98% 100%        0% 100% 100% 100% 100% 100% 100% 100%
   7  100% 100%   0%   0%  28%   0%   0%      100%   0% 100% 100% 100% 100% 100%
   8  100% 100%   0% 100% 100% 100% 100% 100%        0%  34%   0% 100% 100% 100%
   9  100% 100%   0%   0% 100%   0% 100%   0%   0%        0%   0% 100% 100% 100%
  10  100% 100% 100% 100% 100% 100% 100% 100%  34%   0%        0% 100%   0%   0%
  11  100% 100%   0%   0% 100%   0% 100% 100%   0%   0%   0%      100%   0%  64%
  12  100% 100% 100%  90%   0%   0% 100% 100% 100% 100% 100% 100%        0% 100%
  13  100% 100% 100% 100% 100%   0% 100% 100% 100% 100%   0%   0%   0%        0%
  14  100% 100% 100% 100% 100% 100% 100% 100% 100% 100%   0%  64% 100%   0%
  15  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%   0%
  16    0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%
  17    0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%   0%
  18  100% 100% 100% 100%  93% 100%  98% 100% 100% 100% 100% 100% 100% 100% 100%
  19  100% 100% 100% 100%  96% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  20  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  21  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  22  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  23  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  24  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  25  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  26  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  27  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  28  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  29  100% 100% 100% 100%  96% 100%  99% 100% 100% 100% 100% 100% 100% 100% 100%
  30  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  31  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  32  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  33  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  34  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  35  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  36  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  37  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  38  100% 100% 100% 100%  99% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  39  100% 100% 100% 100%  99% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  40  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  41  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  42  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  43  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  44  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  45  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  46  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  47  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  48  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  49  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  50  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  51  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  52  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  53  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  54  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  55  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  56  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  57  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  58  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  59  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  60  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
  61  100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100% 100%
TOTAL  95%  95%  85%  85%  90%  84%  92%  87%  91%  85%  89%  85%  92%  89%  91%

The text was updated successfully, but these errors were encountered:

AHorneffer · 2017-02-24T14:31:13Z

I prefer it if you wouldn't copy&paste parts of (generic-)pipeline logfiles here (they are bleeping hard to read and half of it is usually missing), but attach the entire logfile to the post.

Indeed the smoothing fails, which is then the reason why most of the data gets flagged. (That's what NDPPP does it if encounters a NAN as a calibration value it is asked to apply to data.)

@rvweeren (or @darafferty): does the smooth_amps_spline.py script implicitly assume that the data was taken during only one day? E.g. by assuming that the amplitudes can be modeled over the full time-range by a low-order polynomial?

@soumyajitmandal: You could try setting spline_smooth2D to False in the Factor parset.

soumyajitmandal · 2017-02-27T11:04:34Z

I used spline_smooth2D = False but the error still exists.
parmdbplot.py *4g.merge_amp_parmdbs1 looks fine.

Now the smooth_amps.py did not work out.

I ran this outside factor:
factor/scripts/smooth_amps.py *4g.merge_amp_parmdbs1 *smooth_amp1_test and
parmdbplot.py *smooth_amp1_test shows amplitudes and phases are blank. The output messages were:

invalid value encountered in median

2.invalid value encountered in greated high_ind = numpy.where(amp > 5.0)

I would like to point out, few months ago, I tried factor for two different nights with 40 subbands and we fixed the problem if we had two different antennas , one being flagged (so essentially two different time span) #76
So I repeated the same task for the older run, and it did NOT fail giving also a norm_factor = 1.0011506 . The produced *smoth_amp1_test in this case is fine.

The only differences in these two times are: Full subband and 3 nights data.

soumyajitmandal · 2017-02-27T15:25:05Z

The previous run was with different version of NDPPP than the recent one. So the two parmdbs1 were created with different lofar softwares. I think previously if the interpolation did not find a value, it used to put Zeros but now its putting NaNs instead. Is this the issue, probably?

AHorneffer · 2017-03-01T14:34:34Z

I used spline_smooth2D = False but the error still exists.

But the error message you quoted is from smooth_amps_spline.py, i.e. the spline smoothing script. So you should at least get a different error message.

soumyajitmandal · 2017-03-01T14:37:52Z

I thought putting spline_smooth2D = False turns off the use of smooth_amps_phases_spline.py not the smooth_amps.py , right? So the error am I got from the last run, was from smooth_amps.py

rvweeren · 2017-03-01T14:47:48Z

spline_smooth2D = False turns off spline smoothing over the frequency axis. It still does a spline smooth across the time axis. (I think it it always uses smooth_amps_phases_spline.py and smooth_amps.py is not used anymore if I am correct)

AHorneffer · 2017-03-01T14:49:29Z

@rvweeren: Ah, O.K.
When smoothing along the time axis, does the smooth_amps_spline.py script implicitly assume that the data was taken during only one day? (E.g. by assuming that the amplitudes can be modeled over the full time-range by a low-order polynomial?)

rvweeren · 2017-03-01T14:59:47Z

I checked smooth_amps_spline.py the script and it creates a time axis
times = numpy.copy(sorted( parms[key_names[0]]['times']))

Maybe it fails because of that and it cannot handle a very large gap (although looking at the code the spline does not directly use that time axis in the spline fit). The easiest way to debug this is to take the parmdb and run smooth_amps_spline.py manually on it and check where it fails in the script.

rvweeren · 2017-03-01T15:06:34Z

Hmm, update/correction, apparently it does use smooth_amps.py if spline_smooth2d=False

if self.parset['calibration_specific']['spline_smooth2d']:
smooth_amps_task = 'smooth_amps_spline'
else:
smooth_amps_task = 'smooth_amps'

(from facet_ops.py)

rvweeren · 2017-03-01T15:33:58Z

It might be that the script is failing because there are NaN input values. Otherwise I cannot see why

high_ind = numpy.where(amp > 5.0)

could give an error message. I guess you need to open the scripts and do some debugging here and figure out precisely where it goes wrong, in smooth_amps_spline.py and smooth_amps.py
Give it a try and see how far you can get, the scripts are not very complicated (if you still remain stuck provide the parmdb)

soumyajitmandal · 2017-03-01T15:34:05Z

yeah indeed the error was with smooth_amps.py this time.

so I ran: smooth_amps.py on the merged parmdb: smooth_amps.py *4g.merge_amp_parmdbs1 *smooth_amp1_test

I did a print on 'ampl' after this line:
ampl_tot_copy = numpy.copy(ampl)

Where the values were NaNs. Whereas, in my successful run few months earlier, doing the same thing gives me Zeros.

rvweeren · 2017-03-01T15:39:40Z

You need get to get back further, ampl_tot_copy = numpy.copy(ampl) is already too deep into the script.

The question for you to answer is (1) does the input parmdb contain NaNs and (2) is that the reason why it fails (because smooth_amps.py is not NaN proof).

Check channel_parms_real and channel_parms_imag on line 125/126.

soumyajitmandal · 2017-03-01T16:05:08Z

yes channel_parms_real and channel_parms_imag also have NaN values. So the input parmdb has NaN values. Whereas, earlier it used to have Zeroes.

rvweeren · 2017-03-01T16:08:40Z

Ok, so it looks like smooth_amps.py is simply not NaN proof.

soumyajitmandal · 2017-03-02T13:35:10Z

hmm okay. Is it a good idea to put zeros in place of NaNs? Or is it not a good solution?

rvweeren · 2017-03-03T18:09:31Z

You should try to edit smooth_amps.py so that it is NaN proof (with minimal other changes). Without having looked at it in detail I think that should not be very difficult to do.

(I probably do not have time to look at it myself over the next two weeks, after that I might have time to help with that and also check smooth_amps_spline.py , because in the end it is preferable to use spline_smooth2d as is more capable in detecting amplitude outliers)

AHorneffer · 2017-03-07T10:23:07Z

@soumyajitmandal: Can you put a parmDB with NANs somewhere where I can find it, to test the code?

@ALL: What should we do with the flagged data? Replacing the amplitudes with the median value is straight forward, but what should we do with the phases? Setting them to zero would be the most simple. Finding a useful median for phases is not only more complicated, but I also don't know if it is a good idea.

soumyajitmandal · 2017-03-07T14:31:41Z

I did a test by putting zeroes instead of NaNs but normalisation is messed up in that process. Rather using a masked array might be useful ?
channel_parms_real = numpy.ma.masked_invalid(channel_parms_real)
but in this way, the median function might not work though.

Attached is the parmdb.
lockman_amp_parmdbs1.zip

soumyajitmandal · 2017-03-08T09:09:32Z

Including the masked array in a different part seems to be working so far in my case. I put spline_smooth2D = False which means it is using smooth_amps.py

I changed (line 146):
amp = numpy.ma.masked_invalid(numpy.copy(numpy.sqrt(real2 + imag2))).compressed()
Previously while it was trying to create image31, it was failing since everything was flagged for the NaN entries and no norma_factor was found.
Now till image 42 has been created and parmdbs look fine as well.

AHorneffer · 2017-03-08T10:18:32Z

Well, having had a look at the parmDB you attached here I think it would be important to find our why you have so many NANs in the parmDB. Did you flag large parts of the data? (And why would NDPPP create parmDB entries with NANs in that case, instead of not creating the entries at all.) Or are there parts of the data where NDPPP couldn't get a solution even if there was data?

soumyajitmandal · 2017-03-08T10:43:33Z

Since there is a time gap between different nights, I thought its producing the NaNs. In general when I processed different nights data separately, I did not see the NaN issue.

AHorneffer · 2017-03-08T10:49:47Z

Well, the smoothing is done on single time-series (i.e. separate for antenna, polarization, and channel), and several of these time-series are fully flagged.

AHorneffer · 2017-03-08T11:46:29Z

Btw. here is a version of the script, that will not only work with NANs, but also doesn't produce the RuntimeWarnings:
smooth_amps.py.txt

soumyajitmandal · 2017-03-08T16:47:57Z

I will try this version, thanks a lot. I was trying with the temporary fix that I wrote in my previous comment (which I think you also put in the modified text file) and have an error. I reproduced the error outside the pipeline while using the convert_solutions_to_gain.py

convert_solutions_to_gain.py *.pre-cal_chunk12_126407AFCt_4g.merge_phase_parmdbs *.pre-cal_chunk12_126407AFCt_4g.smooth_amp2 gain_test

Traceback (most recent call last):
File "/net/para14/data1/mandal/software/factor_normalize/factor/scripts/convert_solutions_to_gain.py", line 165, in
main(args.fast_selfcal_parmdb, args.slow_selfcal_parmdb, args.output_file, preapply_parmdb=args.preapply_parmdb)
File "/net/para14/data1/mandal/software/factor_normalize/factor/scripts/convert_solutions_to_gain.py", line 96, in main
fast_timewidths, asStartEnd=False)
File "/net/lofar1/data1/oonk/rh7_lof_feb2017_2_19_0/lofar/lib64/python2.7/site-packages/lofar/parmdb/init.py", line 147, in getValues
includeDefaults)
Boost.Python.ArgumentError: Python argument types in
ParmDB._getValues(parmdb, str, numpy.ndarray, numpy.ndarray, numpy.ndarray, numpy.ndarray, bool, bool)
did not match C++ signature:
_getValues(LOFAR::BBS::PyParmDB {lvalue}, std::string parmnamepattern, double sfreq, double efreq, double freqstep, double stime, double etime, double timestep, bool asStartEnd=True, bool includeDefaults=False)
_getValues(LOFAR::BBS::PyParmDB {lvalue}, std::string parmnamepattern, double sfreq=-1e+30, double efreq=1e+30, double stime=-1e+30, double etime=1e+30, bool asStartEnd=True, bool includeDefaults=False)

Has anyone seen this earlier?

darafferty · 2017-05-03T12:25:09Z

I found and fixed a problem in convert_solutions_to_gain.py with parmdbs with large gaps. @soumyajitmandal, can you try your run again? (The fix in only available on the latest master, so if you want to use this with an earlier version of Factor, you'll need to copy the new version into your Factor installation by hand.)

soumyajitmandal · 2017-05-03T12:50:35Z

Thanks David! I have tried this code last week after we chatted at the busy week so I do have the parmdbs created with the latest version. I will just do a git pull and rerun it.

soumyajitmandal · 2017-05-05T07:49:09Z

Hi David,

it seems like convert_solutions_gain.py works now. Probably we need the same kind of fix in the reset_amps.py? Thats where its failing now. I tried to run this outside factor and had a similar error:

 
reset_amps.py L340794_SB000_uv.dppp.pre-cal_126400A74t_121MHz.pre-cal_chunk12_126407AFCt_4g.convert_merged_selfcal_parmdbs test_parm

Traceback (most recent call last): File "./reset_amps.py", line 77, in main(args.instrument_name, args.instrument_name_reset) File "./reset_amps.py", line 32, in main freqs = parms['Gain:1:1:Ampl:{s}'.format(s=antenna_list[0])]['freqs'] IndexError: list index out of range

darafferty · 2017-05-05T09:10:00Z

I think this problem might be fixed by commit 4dc0157. To test it, update Factor, reset the state for the pipeline so that it repeats the convert_merged_selfcal_parmdbs step, then rerun.

darafferty · 2017-05-09T10:47:36Z

OK, I added a check for missing stations to reset_amps.py, so update and give it another try.

botteon · 2017-05-09T11:51:10Z

Hi David, after the last update of yesterday on CEP3 I have this error

2017-05-09 11:23:18 ERROR   facetselfcal_facet_patch_574: Failed pipeline run: facet_patch_574
2017-05-09 11:23:18 ERROR   facetselfcal_facet_patch_574: Detailed exception information:
2017-05-09 11:23:18 ERROR   facetselfcal_facet_patch_574: <class 'lofarpipe.support.lofarexceptions.PipelineRecipeFailed'>
2017-05-09 11:23:18 ERROR   facetselfcal_facet_patch_574: convert_solutions_to_gain failed

I think it is related to the new commit.

botteon · 2017-05-09T12:05:11Z

The error seems to be:

2017-05-09 11:23:16 ERROR   node.lof004.python_plugin: local variable 'gaps_ind' referenced before assignment

darafferty · 2017-05-09T12:31:18Z

Thanks -- the 'gaps_ind' problem should be fixed now (and I updated it on CEP3).

soumyajitmandal · 2017-05-10T10:02:41Z

Okay now I think it has passed the reset_amps.py stage. By the way, is there a plotting issue with parmdbplot.py? I was checking parmdbplot.py *4g.create_preapply_parmdb and the 'polar' plot shows the amplitude to be zero everywhere. Real and Imaginary part look fine.

Anyway, now its failing at the plotting solutions step. But its due to the size issue. here it is:
plot_selfcal_solutions.py -p L340794_SB000_uv.dppp.pre-cal_126400A74t_121MHz.pre-cal_chunk12_126407AFCt_4g.merge_selfcal_parmdbs hola
/software/rhel7/lib64/python2.7/site-packages/numpy/ma/core.py:852: RuntimeWarning: invalid value encountered in greater_equal
return umath.absolute(a) * self.tolerance >= umath.absolute(b)
./plot_selfcal_solutions.py:43: RuntimeWarning: invalid value encountered in less
out[out < -np.pi] += 2.0 * np.pi
./plot_selfcal_solutions.py:44: RuntimeWarning: invalid value encountered in greater
out[out > np.pi] -= 2.0 * np.pi
Traceback (most recent call last):
File "./plot_selfcal_solutions.py", line 662, in
refstation=args.refstation, fourpol=args.fourpol)
File "./plot_selfcal_solutions.py", line 620, in main
solplot_phase(parmdb, imageroot, refstation, plot_international=plot_international, fourpol=fourpol)
File "./plot_selfcal_solutions.py", line 424, in solplot_phase
axsp[istat][0].plot(times, normalize(phase00-phase00_ref_chan), color='b', marker=fmt, ls=ls, label='Gain:0:0:Phase',mec='b')
File "/software/rhel7/lib64/python2.7/site-packages/numpy/ma/core.py", line 3971, in sub
return subtract(self, other)
File "/software/rhel7/lib64/python2.7/site-packages/numpy/ma/core.py", line 1006, in call
result = self.f(da, db, *args, **kwargs)
ValueError: operands could not be broadcast together with shapes (260,) (390,)

darafferty · 2017-05-11T11:52:35Z

I've modified the plot_selfcal_solutions.py script to work with multiple nights. It should now produce one plot per night (instead of all the nights in a single plot, which was unreadable).

soumyajitmandal · 2017-05-11T12:37:09Z

okay. Probably its gonna work, but it says: global name 'fp' is not defined

darafferty · 2017-05-11T13:01:02Z

Oops -- copy and paste error. Try it now.

soumyajitmandal · 2017-05-11T13:46:53Z

plotting step passed. Plots look quite fine. now its preparing the imaging chunk dataset. I will keep updated. :)

soumyajitmandal · 2017-05-15T10:03:01Z

Good news is the code seems bug free for multiple nights now! :)

Some issues (not related with code though I think).
The image looks bad after combining three different nights, individual images look quite similar. I attach calibrator image from one of the nights and the same calibrator from multiple nights:
1night:

3nights:

Looks like amplitude and Tec had problems (?). Phases are fine.

Amplitude (night1):

Amplitude(night2):

TECscalarphase (night1):

TECscalarphase (night2):

How are the amplitudes applied when we merge different observations? I mean does it get averaged? Also, do you think its worth giving a try by flagging the noisy TEC solutions?

darafferty · 2017-05-15T13:31:07Z

Factor doesn't do anything special with data from multiple nights -- they're handled just like data from a single night. So, it will smooth the amplitudes and normalize them in the same way (so a single normalization is done across all three nights). No averaging is done.

It's probably good to flag the periods during night 1 when the solutions are noisy (between hours 6-7 and after hour 8). I'm not sure whether they're the cause of the poor results, though.

twshimwell · 2017-05-15T13:37:59Z

Its very odd that the background noise essentially looks at the same level even though you have 3 times the data. I'd perhaps think the artefacts could stay similar but the noise should really good down a fair bit.

I guess the "sources" in the 3day image that pop up near your bright source are not real? Did you check the masks throughout the calibration of the 3 day one?

soumyajitmandal · 2017-05-15T15:57:14Z

@darafferty I am also imaging them separately now (i.e: each facet has been gone through factor, I used a single model; added that to every night dataset; used gaincal-applycal. now imaging is running, by tomorrow I can see the result I hope).

@darafferty @twshimwell Lets wait till the facet is finished. I checked the full image in the facetselfcal directory (which only has 1/6th ob the bandwidth, is there an option to include all band? Factor does not have that settings anymore, its by default 6). In the facetimage directory (which is running now) it should have the whole BW. I am expecting the background noise might be a bit better than what we are seeing now.

AHorneffer · 2017-05-17T13:31:01Z

@soumyajitmandal I was wondering: did you make sure that the same "InitSubtract" model was subtracted from all three nights? E.g. by running Initial-Subtract on all three nights together, or by subtracting the model of one night from the other two.

If you have different models subtracted from the different nights, then Factor will screw up. (It will just assume that the same model was subtracted from all data and thus treat two of the nights wrong.)

soumyajitmandal · 2017-05-17T14:59:59Z

Each night should be init-subtracted separately right?

Let me explain what I did: I processed three different nights independently till the init-subtract step. So after init-subtraction step, for each night I have (24, 24, 23 in total 71) low2-model.merge skymodels. In my Factor run, the msfiles directory contains all the 71 *.ms files and these low2-model.merge files.

AHorneffer · 2017-05-17T15:14:01Z

Each night should be init-subtracted separately right?

No! (Feel free to add a few more exclamation marks, blinking effects or so.)

Factor will use one model for all files in one frequency band. If you actually give it files for the same frequency band in which different models have been subtracted, then it will screw up. The way you started it, it will randomly(*) choose one of the three skymodels and use that for adding back the sources to the data from all three nights. So the two nights for which another model has been subtracted will get treated wrong.

My suggestion to fix that is to choose the model of one of the three nights, and subtract that from the other two nights.

(*) Well, not actually randomly, but it will be undefined behavior.

twshimwell · 2017-05-17T16:04:31Z

this is the behaviour even if the models are explicitly specified in the factor parset?

AHorneffer · 2017-05-18T08:23:31Z

@twshimwell Yes. The internal data structure and the layout of the pipeline parsets only use one skymodel per frequency-band.

soumyajitmandal · 2017-05-18T08:30:21Z

@AHorneffer hmm. now I am rerunning with 2 different night dataset instead of 3 (should be a bit quicker). This time the msfiles contains measurement sets from both the different nights, and low2-model.merge from only one night. NOT Specifying anything in the parset should not be a problem right?

AHorneffer · 2017-05-18T08:35:22Z

@soumyajitmandal Was the same skymodel subtracted from the measurement sets of both nights?

Not explicitly specifying the model for each band in the parset is only a problem if Factor cannot find the (any) skymodel for a band. But then it will fail during the setup phase.

rvweeren · 2017-05-18T14:14:16Z

As I said, I ran init-subtract independently. So for each night & band, low2-model.merge files were created. But while running factor, I am using only one of these skymodel sets from one night. In Factor, these are the skymodels that will be used to subtract sources (?).

That approach does not make sense (if I understand it correctly what you are attempting), because in the init-subtract you then subtracted a different skymodel (for one of the datasets) than what will be added back in factor.

duyhoang-astro · 2017-05-18T14:57:10Z

@AHorneffer: Can init-subtract step produce one single skymodel for each band when the ms inputs have different names for similar bands? Or we will even need to combine the ms's for the different nights in the pre-facet phase (i.e. concat subbands, direction independent calibration steps, etc.)?

AHorneffer · 2017-05-18T15:25:00Z

@duyhoang2014 I don't really understand your question!

Some general answers:

Never manually concatenate MSs when you want to run prefactor and Factor. (Chances are that you will get it wrong.)
prefactor doesn't use the file-name for sorting MSs (anymore, version 1. did that), instead it will look into the MSs to figure out for which frequency and time the MS is.
You should never have "similar" bands! Either the frequencies in the MSs are identical, or different. Overlapping but not identical frequencies in different MSs will cause problems.

duyhoang-astro · 2017-05-18T22:40:18Z

@AHorneffer : sorry for not being clear. I meant that after the prefactor runs (to do amp. transfer, clock offset correction, direction independent phase calibration, etc.) on different nights separately, we will have concatenated MSs (of e.g. 10 subbands each, or a band) for these separate nights. These MSs for each nights have identical central frequencies (e.g. at 121MHz, 123MHz, etc); and they also have separated instruments (i.e. instrument_directionindependent). When prefactor does the initial subtraction, I can use all MSs of the nights as the input MSs (e.g. put all of the MSs into a single directory). My question is that can prefactor produce a single skymodel for all the nights at each central frequencies (e.g. 121MHz, 123MHz, etc.)?

AHorneffer · 2017-05-19T09:42:14Z

When prefactor does the initial subtraction, I can use all MSs of the nights as the input MSs (e.g. put all of the MSs into a single directory).

Yes, you can. That is indeed they way I intended the pipelines to be used when I wrote them, and for interleaved observations it is also the way they should be used. If you have several "full" nights, then this has the drawback that the imaging takes even longer, so I unsure what to recommend: imaging all nights together, or only image one night and then subtract that skymodel from the other nights.

My question is that can prefactor produce a single skymodel for all the nights at each central frequencies (e.g. 121MHz, 123MHz, etc.)?

If you want to run Factor on the data, then you need one(!) skymodel that has been subtracted from all time-chunks (e.g. nights). I've been repeating this over and over again in the last two days, so:
@soumyajitmandal and @duyhoang2014 Can you please add that to the documentation where you would have read it and in a way that you would have understood it! Feel free to just edit e.g. the prefactor wiki and don't worry too much about getting it wrong, I can always correct it if I think it is wrong.

Do I need to repeat that if you run Initial-Subtract (any of the three versions) on multiple night then you get one skymodel for all nights together? There is no pipeline / script yet to subtract a given skymodel (or a set of given skymodels, one for each frequency band) from the MSs of another night, but I'm sure one of you is willing to write one and add it to prefactor. 😁

duyhoang-astro · 2017-05-21T20:14:32Z

@AHorneffer: Many thanks!

Yes, you can. That is indeed they way I intended the pipelines to be used when I wrote them, and for interleaved observations it is also the way they should be used. If you have several "full" nights, then this has the drawback that the imaging takes even longer, so I unsure what to recommend: imaging all nights together, or only image one night and then subtract that skymodel from the other nights.

Ok. So prefactor (e.g. Intial-Subtract) does work with interleaved data sets.

If you want to run Factor on the data, then you need one(!) skymodel that has been subtracted from all time-chunks (e.g. nights).

Yes, one skymodel (created from all time-chunks) is used for each MS (each band) to run Factor, which I usually do, but forgot when asking. Sorry for asking that again.

There is no pipeline / script yet to subtract a given skymodel (or a set of given skymodels, one for each frequency band) from the MSs of another night, but I'm sure one of you is willing to write one and add it to prefactor.

Currently I have 2 4-hours observations of the same field, which are not too long observations. I would try to combine them all in the Initial-Subtract step. Let's see how things go...

AHorneffer added a commit that referenced this issue Mar 23, 2017

NAN-proof version of smooth_amps.py (#194)

480c3fd

darafferty added a commit that referenced this issue May 3, 2017

Fix conversion of parmdbs with large gaps (#194)

1c944a4

darafferty added a commit that referenced this issue May 9, 2017

Fix case when no gaps are found (#194)

4a36e61

darafferty added a commit that referenced this issue May 11, 2017

Add support for multiple nights (#194)

c44429c

darafferty added a commit that referenced this issue May 11, 2017

Fix fig name (#194)

e9401f7

darafferty mentioned this issue Jun 19, 2017

Question: Best way to process a multi-beam observation? #210

Open

combining 3 nights data through factor #194

combining 3 nights data through factor #194

Comments

soumyajitmandal commented Feb 24, 2017 • edited by tammojan Loading

AHorneffer commented Feb 24, 2017

soumyajitmandal commented Feb 27, 2017 • edited Loading

soumyajitmandal commented Feb 27, 2017

AHorneffer commented Mar 1, 2017

soumyajitmandal commented Mar 1, 2017

rvweeren commented Mar 1, 2017

AHorneffer commented Mar 1, 2017

rvweeren commented Mar 1, 2017

rvweeren commented Mar 1, 2017 • edited Loading

rvweeren commented Mar 1, 2017

soumyajitmandal commented Mar 1, 2017

rvweeren commented Mar 1, 2017

soumyajitmandal commented Mar 1, 2017 • edited Loading

rvweeren commented Mar 1, 2017

soumyajitmandal commented Mar 2, 2017

rvweeren commented Mar 3, 2017 • edited Loading

AHorneffer commented Mar 7, 2017

soumyajitmandal commented Mar 7, 2017

soumyajitmandal commented Mar 8, 2017

AHorneffer commented Mar 8, 2017

soumyajitmandal commented Mar 8, 2017

AHorneffer commented Mar 8, 2017

AHorneffer commented Mar 8, 2017

soumyajitmandal commented Mar 8, 2017 • edited Loading

darafferty commented May 3, 2017

soumyajitmandal commented May 3, 2017

soumyajitmandal commented May 5, 2017

darafferty commented May 5, 2017

darafferty commented May 9, 2017

botteon commented May 9, 2017

botteon commented May 9, 2017

darafferty commented May 9, 2017

soumyajitmandal commented May 10, 2017

darafferty commented May 11, 2017 • edited Loading

soumyajitmandal commented May 11, 2017

darafferty commented May 11, 2017

soumyajitmandal commented May 11, 2017

soumyajitmandal commented May 15, 2017

darafferty commented May 15, 2017 • edited Loading

twshimwell commented May 15, 2017

soumyajitmandal commented May 15, 2017

AHorneffer commented May 17, 2017

soumyajitmandal commented May 17, 2017

AHorneffer commented May 17, 2017

twshimwell commented May 17, 2017

AHorneffer commented May 18, 2017

soumyajitmandal commented May 18, 2017

AHorneffer commented May 18, 2017

rvweeren commented May 18, 2017

duyhoang-astro commented May 18, 2017

AHorneffer commented May 18, 2017

duyhoang-astro commented May 18, 2017 • edited Loading

AHorneffer commented May 19, 2017

duyhoang-astro commented May 21, 2017

soumyajitmandal commented Feb 24, 2017 •

edited by tammojan

Loading

soumyajitmandal commented Feb 27, 2017 •

edited

Loading

rvweeren commented Mar 1, 2017 •

edited

Loading

soumyajitmandal commented Mar 1, 2017 •

edited

Loading

rvweeren commented Mar 3, 2017 •

edited

Loading

soumyajitmandal commented Mar 8, 2017 •

edited

Loading

darafferty commented May 11, 2017 •

edited

Loading

darafferty commented May 15, 2017 •

edited

Loading

duyhoang-astro commented May 18, 2017 •

edited

Loading