The following snippets show the support of PODIO for the different use cases as well as some potential pitfalls. These examples are mainly concerned with how collections of objects and the objects themselves interact. As such they are framework agnostic.
Every data created is either explicitly owned by collections or automatically garbage-collected. There is no need for any new
or delete
call on user side.
As a general rule: If an object has been added to a collection, the collection
assumes ownership and if the collection goes out of scope all handles that point
to objects in this collection are invalidated as well.
Objects and collections can be created via factories, which ensure proper object ownership:
auto hits = HitCollection{};
auto hit1 = hits.create(1.4,2.4,3.7,4.2); // init with values
auto hit2 = hits.create(); // default-construct object
hit2.energy(42.23);
In addition, individual objects can be created in the free. If they aren't attached to a collection, they are automatically garbage-collected:
auto hit1 = Hit();
auto hit2 = Hit();
...
hits.push_back(hit1);
...
<automatic deletion of hit2>
In this respect all objects behave like objects in Python.
The library supports the creation of one-to-many relations:
auto hits = HitCollection{};
auto hit1 = hits.create();
auto hit2 = hits.create();
auto clusters = ClusterCollection{};
auto cluster = clusters.create();
cluster.addHit(hit1);
cluster.addHit(hit2);
The references can be accessed via iterators on the referencing objects
for (auto i = cluster.Hits_begin(), \
end = cluster.Hits_end(); i!=end; ++i){
std::cout << i->energy() << std::endl;
}
or via direct accessors
auto size = cluster.Hits_size();
auto hit = cluster.Hits(<aNumber>);
If asking for an entry outside bounds, a std::out_of_range
exception is thrown.
Looping through collections is supported in two ways. Via iterators:
for(auto i = hits.begin(), end = hits.end(); i != end; ++i) {
std::cout << i->energy() << std::endl;
}
and via direct object access:
for(int i = 0, end = hits.size(), i != end, ++i){
std::cout << hit[i].energy() << std::endl;
}
The notebook pattern
uses the assumption that it is better to create a small
copy of only the data that are needed for a particular calculation. This
pattern is supported by providing access like
auto x_array = hits.x(); // returning all values
auto y_array = hits.y(10); // or only the first 10 elements
The resulting std::vector
can then be used in (auto-)vectorizable code.
Passing in a size argument is optional; If no argument is passed all elements will be returned,
if an argument is passed only as many elements as requested will be returned.
If the collection holds less elements than are requested, only as elements as are available will be returned.
The podio::Frame
is the main container for containing and grouping collections
together. It has two main methods:
/// Store a collection
template<typename T>
const T& put(T&& coll, const std::string& name);
/// access a collection
template<typename T>
const& T get(const std::string& name);
Note that for put
ting collections into the Frame an explicit std::move
is
necessary to highlight the change of ownership that happens in this case.
Collections can be retrieved explicitly:
auto& hits = frame.get<HitCollection>("hits");
if (hits.isValid()) { ... }
Or implicitly when following an object reference. In both cases the access to data that has been retrieved is const
.
Sometimes it is necessary or useful to store additional data that is not directly foreseen in the EDM.
This could be configuration parameters of simulation jobs, or parameter descriptions like cell-ID encoding etc. PODIO currently allows to store such meta data in terms of a GenericParameters
class that
holds an arbitrary number of named parameters of type int, float, string
or vectors if these.
Meta data can be stored and retrieved from the Frame
via the templated putParameter
and getParameter
methods.
The Reader
and Writer
classes in the root_io
and sio_io
submodules
provide all the necessary functionality to read and write event files. An
example of reading files looks like this:
from podio.root_io import Reader
reader = Reader("one or many input files")
for event in reader.get("events"):
hits = store.get("hits")
for hit in hits:
# ...