From 3614221ae306a1a5b2f62dd8423075b6561eae19 Mon Sep 17 00:00:00 2001
From: Evgeny Blokhin <eb@tilde.pro>
Date: Tue, 5 Jul 2022 17:05:14 +0200
Subject: [PATCH 1/3] Polish text

---
 index.html | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)
diff --git a/index.html b/index.html
index af69dd7..24da357 100755
--- a/index.html
+++ b/index.html
@@ -920,7 +920,7 @@ <h3 id="JSON-schemata">&sect;1.4. JSON schemata of the MPDS entries</h3>
 
             <h3 id="Example-scripts">&sect;1.5. Example scripts</h3>
 
-            <p class="view">Below are simple examples for the MPDS data retrieval in two programming languages: <strong>Python</strong> and <strong>JavaScript</strong>. More languages can be added by request. Note, that here the Python example requires an external package <span class="t">httplib2</span>. We provide more convenient <a href="https://github.com/mpds-io/python-api-client">Python client library</a>, so these examples here serve demonstration purposes only.</p>
+            <p class="view">Below are simple examples for the MPDS data retrieval in two programming languages: <strong>Python</strong> and <strong>JavaScript</strong>. More languages can be added by request. Note, that here the Python example requires an external package <span class="t">httplib2</span>. We provide more convenient <a href="https://github.com/mpds-io/python-api-client">Python client library</a> (see the next sections), so these examples here serve demonstration purposes only.</p>
 
             <div id="example_python" class="view">
             <pre id="L5"><code>#!/usr/bin/env python
@@ -1033,7 +1033,7 @@ <h3 id="Overview-2">&sect;2.1. Overview</h3>
 
             <h3 id="Client-library">&sect;2.2. Client library</h3>
 
-            <p class="view">As mentioned, any programming language, able to execute HTTP requests and handle the JSON output, can be employed. However, one of the most frequently used languages in data processing is Python. Therefore we provide a <a href="https://github.com/mpds-io/python-api-client">client library</a> for Python versions 2.7 and 3.6. This library takes care of many aspects of the MPDS API, such as pagination, error handling, validation, proper data extraction and more. We encourage our users to adopt this library for their needs. It is installed as any other Python library:</p>
+            <p class="view">As mentioned, any programming language, able to execute HTTP requests and handle the JSON output, can be employed. However, one of the most frequently used languages in data processing is Python. Therefore we provide a <a href="https://github.com/mpds-io/python-api-client">Python library</a>, taking care of many aspects of the MPDS API, such as pagination, error handling, validation, proper data extraction and more. We encourage our users to adopt this library for their needs. It is installed as any other Python library:</p>
 
             <div class="blackbg">pip install mpds_client</div>
 
@@ -1339,7 +1339,7 @@ <h3 id="Visualizations">&sect;2.6. Visualizations</h3>
 
             <p class="view">Although the reader is encouraged to visualize the data using his habitual tools, we provide a set of helper utilities. Using a simple exporting toolbox in our Python client library each of the exercises considered above may output two files for the further plotting: <strong>CSV</strong> and <strong>JSON</strong>.</p>
 
-            <p class="view">By default these two files are written in a system-wide temporary directory <span class="t">/tmp</span> (subdirectory <span class="t">_MPDS</span>). <strong>CSV</strong> is commonly used in the electronic sheets (such as OpenOffice Calc or Excel), and <strong>JSON</strong> has a custom self-explanatory layout suitable for <a href="/visavis">Vis-&agrave;-vis web-viewer</a>. This is quite unsophisticated browser-based JavaScript application, heavily used inside the MPDS GUI. It employs <a href="https://plot.ly">Plotly</a> and <a href="https://d3js.org">D3</a> visualization libraries. <strong>JSON</strong> produced with the exporting toolbox may be simply drag-n-dropped in the browser window with the loaded <a href="/visavis">Vis-&agrave;-vis</a>.</p>
+            <p class="view">By default these two files are written in a system-wide temporary directory <span class="t">/tmp</span> (subdirectory <span class="t">_MPDS</span>). <strong>CSV</strong> is commonly used in the electronic sheets (such as OpenOffice Calc), and <strong>JSON</strong> has a custom self-explanatory layout suitable for <a href="/visavis">Vis-&agrave;-vis web-viewer</a>. This is quite unsophisticated browser-based JavaScript application, heavily used inside the MPDS GUI. It employs <a href="https://plot.ly">Plotly</a> and <a href="https://d3js.org">D3</a> visualization libraries. <strong>JSON</strong> produced with the exporting toolbox may be simply drag-n-dropped in the browser window with the loaded <a href="/visavis">Vis-&agrave;-vis</a>.</p>
 
             <p style="background:#f4fbff;">We thank the reader for the time and interest! Any questions or feedback is <a href="mailto:feedback@tilde.pro">very welcomed and greatly appreciated</a>.</p>
         </div>

From efebb1ca1e1a82b7c3437fdbf96d24528d9dc589 Mon Sep 17 00:00:00 2001
From: Evgeny Blokhin <eb@tilde.pro>
Date: Thu, 21 Jul 2022 14:00:28 +0200
Subject: [PATCH 2/3] Update phase ID wording

---
 index.html | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/index.html b/index.html
index 24da357..fce66f9 100755
--- a/index.html
+++ b/index.html
@@ -632,7 +632,7 @@ <h3 id="Overview-1">&sect;1.1. MPDS data structure</h3>
 
             <p class="view">The standard unit of the MPDS data is an <strong>entry</strong>. All the MPDS entries are subdivided into three kinds: <strong>crystalline structures</strong>, <strong>physical properties</strong>, and <strong>phase diagrams</strong>. They are called S-, P- or C-entries, correspondingly. Entries have persistent identifiers (similar to DOIs), <i>e.g.</i> <span class="t">S377634</span>, <span class="t">P600028</span>, <span class="t">C100027</span>.</p>
 
-            <p class="view">Another dimension of the MPDS data is the <strong>distinct phases</strong>. The three kinds of entries are interlinked via the distinct materials phases they belong. A tremendous work was done by the PAULING FILE team in the past 20 years to manually distinguish about 200&nbsp;000 inorganic materials phases, appearing in the literature. Each phase has a unique combination of (<i>a</i>) chemical formula, (<i>b</i>) space group, (<i>c</i>) Pearson symbol. Each phase has an integer identifier called <span class="t">phase_id</span>.</p>
+            <p class="view">Another dimension of the MPDS data is the <strong>distinct phases</strong>. The three kinds of entries are interlinked via the distinct materials phases they belong. A tremendous work was done by the PAULING FILE team in the past 20 years to manually distinguish about 200&nbsp;000 inorganic materials phases, appearing in the literature. Each phase has a unique combination of (<i>a</i>) chemical formula, (<i>b</i>) space group, (<i>c</i>) Pearson symbol. Each phase has the permanent integer identifier called <span class="t">phase_id</span>.</p>
 
             <p class="view">Consider the following example of the <strong>entries</strong> and <strong>distinct phases</strong>. There can be the following distinct phases for the titanium dioxide: rutile with the space group <i>136</i> (let us say, <span class="t">phase_id&nbsp;1</span>), anatase with the space group <i>141</i> (<span class="t">phase_id&nbsp;2</span>), and brookite with the space group <i>61</i> (<span class="t">phase_id&nbsp;3</span>). Then the S- and P-entries for the titanium dioxide must refer to either <span class="t">1</span>, or <span class="t">2</span>, or <span class="t">3</span>, and the C-entries must refer to <span class="t">1</span>, <span class="t">2</span>, and <span class="t">3</span> simultaneously.</p>
 

From 71e71f287ccbb4eec51ee88e6ea4250cf4f77fbf Mon Sep 17 00:00:00 2001
From: Evgeny Blokhin <eb@tilde.pro>
Date: Tue, 20 Sep 2022 22:22:28 +0200
Subject: [PATCH 3/3] Add more comments

---
 kickoff/miner_ab_etransport.py | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/kickoff/miner_ab_etransport.py b/kickoff/miner_ab_etransport.py
index 98bbc03..31ce5ab 100755
--- a/kickoff/miner_ab_etransport.py
+++ b/kickoff/miner_ab_etransport.py
@@ -7,7 +7,7 @@
 
 from etransport_raw import analyze_raw # this is given in the supplied file "etransport_raw.py"
 
-# the raw data on the MPDS are in 7z format
+# the raw simulation data on the MPDS are in 7z format
 # so we need the latest dev version of pylzma
 # pip install git+https://github.com/fancycode/pylzma
 # then py7zlib is available
@@ -19,7 +19,7 @@
 
 for entry in mpds_api.get_data({'props': 'electrical conductivity'}, fields={}):
 
-    archive_url = entry['sample']['measurement'][0]['raw_data'] # this is the raw data archive location
+    archive_url = entry['sample']['measurement'][0]['raw_data'] # this is the raw data archive field in the MPDS JSON P-entries
 
     p = requests.get(archive_url)
     if p.status_code != 200:
@@ -31,7 +31,7 @@
     archive = Archive7z(io.BytesIO(p.content))
     for virtual_path in archive.files:
 
-        if virtual_path.filename != 'TRANSPORT/SIGMA.DAT':
+        if virtual_path.filename != 'TRANSPORT/SIGMA.DAT': # raw simulation output log file
             continue
 
         # this is how we extract data from the 7z-archive
@@ -40,4 +40,4 @@
         result = analyze_raw(rawdata)
         rawdata.seek(0)
 
-        print(entry['sample']['material']['phase'], result)
\ No newline at end of file
+        print(entry['sample']['material']['phase'], result)