Diff for "PJJung/CAGMonEtude"

Differences between revisions 15 and 18 (spanning 3 versions)

 ,-----.  ,---.   ,----.   ,--.   ,--.                            ,--.             ,--.        
'  .--./ /  O  \ '  .-./   |   `.'   | ,---. ,--,--,      ,---. ,-'  '-.,--.,--. ,-|  | ,---.  
|  |    |  .-.  ||  | .---.|  |'.'|  || .-. ||      \    | .-. :'-.  .-'|  ||  |' .-. || .-. : 
'  '--'\|  | |  |'  '--'  ||  |   |  |' '-' '|  ||  |    \   --.  |  |  '  ''  '\ `-' |\   --. 
 `-----'`--' `--' `------' `--'   `--' `---' `--''--'     `----'  `--'   `----'  `---'  `----'

Description

The CAGMon etude is a study version of CAGMon that evaluates the dependence between the primary and auxiliary channels.

Project goal

The goal of this project is to find a systematic way of identifying the abnormal glitches in the gravitational-wave data using various methods of correlation analysis. Usually, the community such as LIGO, Virgo, and KAGRA uses a conventional way of finding glitches in auxiliary channels of the detector - Klein-Welle, Omicron, Ordered Veto Lists, etc. However, some different ways can be possible to find and monitor them in a (quasi-) realtime. Also, the method can point out which channel is responsible for the found glitch. In this project, we study its possible to apply three different correlation methods - maximal information coefficient, Pearson's correlation coefficient, and Kendall's tau coefficient - in the gravitational wave data from the KAGRA detector.

Participants

John.J Oh (NIMS)
Young-Min Kim (UNIST)
Pil-Jong Jung (NIMS)

Methods and Frameworks

Maximal Information Coefficient (MIC)

the Maximal Information coefficient(MIC) of a set D of two-variable data with sample size n and grid less than B(n) is given by

\[ MIC(D)=\underset{xy<B(n)}{\max}{\left\{ \frac{I^{*}(D,x,y)}{\log \min \left\{x,y \right\}} \right \}} \],

where \[\omega(1)<B(n)\le O(n^{1-\epsilon}) \] for some \[ 0<\epsilon<1 \]

Pearson's Correlation Coefficient (PCC)

Pearson Correlation Coefficient(PCC) is a statistic that explains the amount of variance accounted for in the relationship between two (or more) variables by \[ R=} \],

where \[ \overline{X} \] and \[ \overline{Y} \] are the mean of X and Y, respectively

Kendall's tau Coefficient

Kendall’s tau with a random samples n of observations from two variables measures the strength of the relationship between two ordinal level variables by

\[ \tau =\frac{c-d} \],

where c is the number of concordant pairs, and d is the number of discordant pairs

Flow chart

Code development

GitHub

TBA

Code version

CAGMon Etude Alpha
- for the basic test and evaluation of the LASSO regression method developed by LIGO
- reproduced original CAGMon methods and idea
CAGMon Etude Beta
- added coefficient trend plots with LASSO beta, coherence, MIC, PCC, and Kendall's tau
CAGMon Etude Delta
- fixed a critical problem that sucked enormous memory when it used the matplotlib module
CAGMon Etude Eta
- fixed memory issues
- fixed minor bugs
- added the range limitation of stride
CAGMon Etude Flat (latest version)
- fixed minor issues and optimized scripts
- added the script of HTML summary page
- added coefficient distribution plots

Structure of scripts

Agrement.py
- the script gathered functions the medel required
Melody.py
- the script to calcutate each coefficient and to save trend data as csv
Conchord.py
- the script to make plots, such as coefficient trend, coefficient distribution trend, time-series, and scatter plots
Echo.py
- the script to save the result as HTML web page
CAGMonEtudeFlat.py
- the script to run each script

User guide

How to use CAGMon

Needs of code development

Daily running on KAGRA

Exemplary results

1. Earthquake effects during O3GK

Datetime: 19 April 2020 20:39 UTC
Purpose
- Test to run CAGMon algorithm with a remarkable event
- To figure out the cause of lock-loss in KAGRA
Results

2. Skim through all obs-segments of O3GK

Purpose
- Test for calculation time and required resources with all observation segments during O3GK
- To figure out trigger events or abnormal behaviors
Results

-  ⇤ ← Revision 15 as of 2021-01-21 15:16:51 → 
  Size: 4436
  Editor: PJJung
  Comment:
+   ← Revision 18 as of 2021-01-22 11:44:00 → ⇥
  Size: 5593
  Editor: PJJung
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 1:
-= CAGMon etude =
+{{{                                                                                           
 ,-----.  ,---.   ,----.   ,--.   ,--.                            ,--.             ,--.        
'  .--./ /  O  \ '  .-./   |   `.'   | ,---. ,--,--,      ,---. ,-'  '-.,--.,--. ,-|  | ,---.  
|  |    |  .-.  ||  | .---.|  |'.'|  || .-. ||      \    | .-. :'-.  .-'|  ||  |' .-. || .-. : 
'  '--'\|  | |  |'  '--'  ||  |   |  |' '-' '|  ||  |    \   --.  |  |  '  ''  '\ `-' |\   --. 
 `-----'`--' `--' `------' `--'   `--' `---' `--''--'     `----'  `--'   `----'  `---'  `----'                                                                                             
}}}
-Line 49:
+Line 57:
+==== Flow chart ====
-Line 53:
+Line 64:
-TBA
+[[TBA]]
-Line 84:
+Line 95:
+==== User guide ====
 * [[How to use CAGMon]]

==== Needs of code development ====
 * Daily running on KAGRA
-Line 96:
+Line 114:
+. Skim through all obs-segments of O3GK 
 * Purpose
  * Test for calculation time and required resources with all observation segments during O3GK
  * To figure out trigger events or abnormal behaviors
 * Results
-Line 104:
+Line 129:
+[[https://science.sciencemag.org/content/334/6062/1518 | Science.1518; Detecting Novel Associations in Large Data Sets]]