forked from mhausenblas/5stardata.info
-
Notifications
You must be signed in to change notification settings - Fork 1
/
index.html
238 lines (219 loc) · 12.5 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">
<html xmlns="http://www.w3.org/1999/xhtml"
xmlns:xsd ="http://www.w3.org/2001/XMLSchema#"
xmlns:owl="http://www.w3.org/2002/07/owl#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:dcterms="http://purl.org/dc/terms/"
>
<head>
<title>5 star Open Data</title>
<link type="text/css" href="style.css" rel="stylesheet" />
</head>
<body>
<div id="menu"><a href="#by-example">By Example</a> | <a href="#cost-benefits">Costs & Benefits</a> | <a href="#see-also">See Also</a></div>
<h1 property="dcterms:title">5 ★ Open Data</h1>
<p id="about" class="narrow">
<a href="http://www.w3.org/People/Berners-Lee/card#i">Tim Berners-Lee</a>, the inventor of the Web and Linked Data initiator, suggested a <a href="http://www.w3.org/DesignIssues/LinkedData.html">5 star deployment scheme</a> for Open Data. Here, we give examples for each step of the stars and explain costs and benefits that come along with it.
</p>
<div class="img-container">
<img src="5star-steps.png" alt="5-star steps by example" />
</div>
<div class="sep"></div>
<h2 id="by-example">By Example ...</h2>
<p>
Below, we provide examples for each level of Tim's 5 star Open Data plan. The example data used throughout is '<em>the temperature forecast for Galway, Ireland for the next 3 days</em>':
</p>
<table>
<tr>
<td class="stars">★</td>
<td>make your stuff available on the Web (whatever format) under an open license<a class="footnote" href="#addendum1" title="view costs and benefits of 1 star data">1</a></td>
<td><a href="gtd-1.pdf" title="1 star Galway temperature data">example ...</a></td>
</tr>
<tr>
<td class="stars">★★</td>
<td>make it available as structured data (e.g., Excel instead of image scan of a table)<a class="footnote" href="#addendum2" title="view costs and benefits of 2 star data">2</a></td>
<td><a href="gtd-2.xls" title="2 star Galway temperature data">example ...</a></td>
</tr>
<tr>
<td class="stars">★★★</td>
<td>use non-proprietary formats (e.g., CSV instead of Excel)<a class="footnote" href="#addendum3" title="view costs and benefits of 3 star data">3</a></td>
<td><a href="gtd-3.csv" title="3 star Galway temperature data">example ...</a></td>
</tr>
<tr>
<td class="stars">★★★★</td>
<td>use URIs to denote things, so that people can point at your stuff<a class="footnote" href="#addendum4" title="view costs and benefits of 4 star data">4</a></td>
<td><a href="gtd-4.html" title="4 star Galway temperature data">example ...</a></td>
</tr>
<tr>
<td class="stars">★★★★★</td>
<td>link your data to other data to provide context<a class="footnote" href="#addendum5" title="view costs and benefits of 5 star data">5</a></td>
<td><a href="gtd-5.html" title="5 star Galway temperature data">example ...</a></td>
</tr>
</table>
<div class="sep"></div>
<h2 id="cost-benefits">Costs & Benefits ...</h2>
<div class="addendum" id="addendum1">
<div class="leftcol">
<h2>What are the costs & benefits of <span class="stars-inline">★</span> Web data?</h2>
<p>As a consumer ...</p>
<ul class="check">
<li>✔ You can look at it.</li>
<li>✔ You can print it.</li>
<li>✔ You can store it locally (on your hard drive or on an USB stick).</li>
<li>✔ You can enter the data into any other system.</li>
<li>✔ You can change the data as you wish.</li>
<li>✔ You can share the data with anyone you like.</li>
</ul>
<p>As a publisher ...</p>
<ul class="check">
<li>✔ It's simple to publish.</li>
<li>✔ You do not have explain repeatedly to others that they can use your data.</li>
</ul>
</div>
<div class="rightcol">
<div class="squo">“</div>
<p>
It's great to have the data accessible on the Web under an open license (such as <a href="http://www.opendatacommons.org/licenses/pddl/" title="Open Data Commons » Public Domain Dedication and License (PDDL)">PDDL</a>, <a href="http://www.opendatacommons.org/licenses/by/" title="Open Data Commons » Open Data Commons Attribution License">ODC-by</a> or <a href="http://creativecommons.org/publicdomain/zero/1.0/" title="Creative Commons —CC0 1.0 Universal">CC0</a>), however, the data is locked-up in a document.
Other than writing a custom scraper, it's hard to get the data out of the document.
</p>
<div class="equo">”</div>
</div>
<div class="sidecol"></div>
</div>
<div class="addendum" id="addendum2">
<div class="leftcol">
<h2>What are the costs & benefits of <span class="stars-inline">★★</span> Web data?</h2>
<p>As a consumer, you can do all what you can do with <span class="stars-inline">★</span> Web data and additionally:</p>
<ul class="check">
<li>✔ You can directly process it with proprietary software to aggregate it, perform calculations, visualise it, etc.</li>
<li>✔ You can export it into another (structured) format.</li>
</ul>
<p>As a publisher ...</p>
<ul class="check">
<li>✔ It's still simple to publish.</li>
</ul>
</div>
<div class="rightcol">
<div class="squo">“</div>
<p>
Splendid! The data is accessible on the Web in a structured way (that is, machine-readable), however, the data is still locked-up in a document.
To get the data out of the document you depend on proprietary software.
</p>
<div class="equo">”</div>
</div>
<div class="sidecol"></div>
</div>
<div class="addendum" id="addendum3">
<div class="leftcol">
<h2>What are the costs & benefits of <span class="stars-inline">★★★</span> Web data?</h2>
<p>As a consumer, you can do all what you can do with <span class="stars-inline">★★</span> Web data and additionally:</p>
<ul class="check">
<li>✔ You can manipulate the data in any way you like, without being confined by the capabilities of any particular software.</li>
</ul>
<p>As a publisher ...</p>
<ul class="check">
<li>⚠ You might need converters or plug-ins to export the data from the proprietary format.</li>
<li>✔ It's still rather simple to publish.</li>
</ul>
</div>
<div class="rightcol">
<div class="squo">“</div>
<p>
Excellent! The data is not only available via the Web but now everyone can use the data easily.
On the other hand, it's still data on the Web and not <a href="http://webofdata.wordpress.com/2010/03/01/data-and-the-web-choices/" title="Data and the Web – a great many of choices « Web of Data">data in the Web</a>.
</p>
<div class="equo">”</div>
</div>
<div class="sidecol"></div>
</div>
<div class="addendum" id="addendum4">
<div class="leftcol">
<h2>What are the costs & benefits of <span class="stars-inline">★★★★</span> Web data?</h2>
<p>As a consumer, you can do all what you can do with <span class="stars-inline">★★★</span> Web data and additionally:</p>
<ul class="check">
<li>✔ You can link to it from any other place (on the Web or locally).</li>
<li>✔ You can bookmark it.</li>
<li>✔ You can reuse parts of the data.</li>
<li>✔ You may be able to reuse existing tools and libraries, even if they only understand parts of the pattern the publisher used.</li>
<li>⚠ Understanding the structure of an RDF "Graph" of data can be more effort than tabular (Excel/CSV) or tree (XML/JSON) data.</li>
<li>✔ You can combine the data safely with other data. URIs are a global scheme so if two things have the same URI then it's intentional, and if so that's well on it's way to being 5 star data!</li>
</ul>
<p>As a publisher ...</p>
<ul class="check">
<li>✔ You have fine-granular control over the data items and can optimise their access (load balancing, caching, etc.)</li>
<li>✔ Other data publishers can now link into your data, promoting it to 5 star!</li>
<li>⚠ You typically invest some time slicing and dicing your data.</li>
<li>⚠ You'll need to assign URIs to data items and think about how to represent the data.</li>
<li>⚠ You need to either find existing patterns to reuse or create your own.</li>
</ul>
</div>
<div class="rightcol">
<div class="squo">“</div>
<p>
Wonderful! Now it's data <strong>in</strong> the Web. The (most important) data items have a URI and can be shared on the Web.
A native way to represent the data is using RDF, however other formats such as Atom can be converted/mapped, if required.
</p>
<div class="equo">”</div>
</div>
<div class="sidecol"></div>
</div>
<div class="addendum" id="addendum5">
<div class="leftcol">
<h2>What are the costs & benefits of <span class="stars-inline">★★★★★</span> Web data?</h2>
<p>As a consumer, you can do all what you can do with <span class="stars-inline">★★★★</span> Web data and additionally:</p>
<ul class="check">
<li>✔ You can discover more (related) data while consuming the data.</li>
<li>✔ You can directly learn about the data schema.</li>
<li>⚠ You now have to deal with broken data links, just like 404 errors in web pages.</li>
<li>⚠ Presenting data from an arbitrary link as fact is as risky as letting people include content from any website in your pages. Caution, trust and common sense are all still necessary.</li>
</ul>
<p>As a publisher ...</p>
<ul class="check">
<li>✔ You make your data discoverable.</li>
<li>✔ You increase the value of your data.</li>
<li>✔ You own organisation will gain the same benefits from the links as the consumers.</li>
<li>⚠ You'll need to invest resources to link your data to other data on the Web.</li>
<li>⚠ You may need to repair broken or incorrect links.</li>
</ul>
</div>
<div class="rightcol">
<div class="squo">“</div>
<p>
Brilliant! Now it's data, <strong>in</strong> the Web <strong>linked to</strong> other data. Both the consumer and the publisher
benefit from the <a href="http://en.wikipedia.org/wiki/Network_effect" title="Network effect - Wikipedia, the free encyclopedia">network effect</a>.
</p>
<div class="equo">”</div>
</div>
<div class="sidecol"></div>
</div>
<div class="sep"></div>
<h2 id="see-also">See Also</h2>
<ul>
<li>
<a href="http://inkdroid.org/ehs">Ed Summers</a> has provided a <a href="http://inkdroid.org/journal/2010/06/04/the-5-stars-of-open-linked-data/" title="the 5 stars of open linked data">nice rendering</a> of the 5 star scheme that we're building upon here.
</li>
<li>If you have Open Data, you might want to use the <a href="http://lab.linkeddata.deri.ie/2010/lod-badges/">star badges</a> to flag your compliance level.</li>
<li>James G. Kim provided a <a href="http://5stardata.info/kr/" title="별점 5 오픈 데이터">Korean translation</a> of this site.</li>
<li>Christopher Gutteridge has a <a href='http://openorg.ecs.soton.ac.uk/wiki/Linked_Data_Basics_for_Techies'>Linked Data crash course for programmers</a>.</li>
</ul>
<div id="ack">
<p>
Kudos to <a href="http://twitter.com/andyseaborne">Andy Seaborne</a> for pointing out the CSV bug,
to <a href="http://twitter.com/kerfors">Kerstin Forsberg</a> for suggesting the 'data highlighting' in the 4/5 star examples,
as well as to <a href="http://twitter.com/vassilios">Vassilios Peristeras</a> for proposing to explain not only
the 'what' but also the 'why'. Thanks to <a href="https://twitter.com/egonwillighagen">Egon Willighagen</a> for providing more details about benefits of one-star data. Additional contributions from <a href='http://id.ecs.soton.ac.uk/person/1248'>Christopher Gutteridge</a>.
This site is brought to you by the EC FP7 Support Action LOD-Around-The-Clock (<a href="http://latc-project.eu/">LATC</a>).
</p>
</div>
<div id="footer">
<span>
Last update: <span property="dcterms:date" datatype="xsd:date">2012-04-03</span> by <a rel="dcterms:creator" href="http://sw-app.org/mic.xhtml#i">Michael</a> |
</span>
<span>
Code available via <a href="https://github.com/mhausenblas/5stardata.info">GitHub</a>
</span>
</div>
</body>
</html>