Skip to content

Commit

Permalink
TOC
Browse files Browse the repository at this point in the history
  • Loading branch information
sampsyo committed Jul 5, 2023
1 parent 6af21e8 commit 210f7d1
Show file tree
Hide file tree
Showing 2 changed files with 101 additions and 0 deletions.
97 changes: 97 additions & 0 deletions OpenTOC/exhet23.html
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
<html xmlns:bkstg="http://www.atypon.com/backstage-ns" xmlns:urlutil="java:com.atypon.literatum.customization.UrlUtil" xmlns:pxje="java:com.atypon.frontend.services.impl.PassportXslJavaExtentions"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><meta http-equiv="Content-Style-Type" content="text/css"><style type="text/css">
#DLtoc {
font: normal 12px/1.5em Arial, Helvetica, sans-serif;
}

#DLheader {
}
#DLheader h1 {
font-size:16px;
}

#DLcontent {
font-size:12px;
}
#DLcontent h2 {
font-size:14px;
margin-bottom:5px;
}
#DLcontent h3 {
font-size:12px;
padding-left:20px;
margin-bottom:0px;
}

#DLcontent ul{
margin-top:0px;
margin-bottom:0px;
}

.DLauthors li{
display: inline;
list-style-type: none;
padding-right: 5px;
}

.DLauthors li:after{
content:",";
}
.DLauthors li.nameList.Last:after{
content:"";
}

.DLabstract {
padding-left:40px;
padding-right:20px;
display:block;
}

.DLformats li{
display: inline;
list-style-type: none;
padding-right: 5px;
}

.DLformats li:after{
content:",";
}
.DLformats li.formatList.Last:after{
content:"";
}

.DLlogo {
vertical-align:middle;
padding-right:5px;
border:none;
}

.DLcitLink {
margin-left:20px;
}

.DLtitleLink {
margin-left:20px;
}

.DLotherLink {
margin-left:0px;
}

</style><title>ExHET 23:: Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions</title></head><body><div id="DLtoc"><div id="DLheader"><h1>ExHET 23:: Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions</h1><a class="DLcitLink" title="Go to the ACM Digital Library for additional information about this proceeding" referrerpolicy="no-referrer-when-downgrade" href="https://dl.acm.org/doi/proceedings/10.1145/3587278"><img class="DLlogo" alt="Digital Library logo" height="30" src="https://dl.acm.org/specs/products/acm/releasedAssets/images/footer-logo1.png">
Full Citation in the ACM Digital Library
</a></div><div id="DLcontent">
<h3><a class="DLtitleLink" title="Full Citation in the ACM Digital Library" referrerpolicy="no-referrer-when-downgrade" href="https://dl.acm.org/doi/10.1145/3587278.3595642">Tiling Framework for Heterogeneous Computing of Matrix based Tiled Algorithms</a></h3><ul class="DLauthors"><li class="nameList">Narasinga Rao Miniskar</li><li class="nameList">Mohammad Alaul Haque Monil</li><li class="nameList">Pedro Valero-Lara</li><li class="nameList">Frank Liu</li><li class="nameList Last">Jeffrey S. Vetter</li></ul><div class="DLabstract"><div style="display:inline">
<p>Tiling matrix operations can improve the load balancing and performance of applications on heterogeneous computing resources. Writing a tile-based algorithm for each operation with a traditional, hand-tuned tiling approach that uses <em>for</em> loops in C/C++ is cumbersome and error prone. Moreover, it must enable and support the heterogeneous memory management of data objects and also explore architecture-supported, native, tiled-data transfer APIs instead of copying the tiled data to continuous memory before the data transfer. The tiling framework provides a tiled data structure for heterogeneous memory mapping and parameterization to a heterogeneous task specification API. We have integrated our tiled framework into MatRIS (Math kernels library using IRIS). IRIS is a heterogeneous run-time framework with a heterogeneous programming model, memory model, and task execution model. Experiments reveal that the tiled framework for BLAS operations has improved the programmability of tiled BLAS and improved performance by ~20% when compared against the traditional method that copies the data to continuous memory locations for heterogeneous computing.</p>
</div></div>


<h3><a class="DLtitleLink" title="Full Citation in the ACM Digital Library" referrerpolicy="no-referrer-when-downgrade" href="https://dl.acm.org/doi/10.1145/3587278.3595644">Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation</a></h3><ul class="DLauthors"><li class="nameList">Gaurav Verma</li><li class="nameList">Siddhisanket Raskar</li><li class="nameList">Zhen Xie</li><li class="nameList">Abid M Malik</li><li class="nameList">Murali Emani</li><li class="nameList Last">Barbara Chapman</li></ul><div class="DLabstract"><div style="display:inline">
<p>Tuning tensor program generation involves searching for various possible program transformation combinations for a given program on target hardware to optimize the tensor program execution. It is already a complex process because of the massive search space, and exponential combinations of transformations make auto-tuning tensor program generation more challenging, especially when we have a heterogeneous target. In this research, we attempt to address these problems by learning the joint neural network and hardware features and transferring them to the new target hardware. We extensively study the existing state-of-the-art dataset, TenSet, perform comparative analysis on the test split strategies and propose methodologies to prune the dataset. We adopt an attention-inspired approach for tuning the tensor programs enabling them to embed neural network and hardware-specific features. Our approach could prune the dataset up to 45% of the baseline without compromising the Pairwise Comparison Accuracy (PCA). Further, the proposed methodology can achieve on-par or improved mean inference time with 25%-40% of the baseline tuning time across different networks and target hardware.</p>
</div></div>


<h3><a class="DLtitleLink" title="Full Citation in the ACM Digital Library" referrerpolicy="no-referrer-when-downgrade" href="https://dl.acm.org/doi/10.1145/3587278.3595645">Harnessing Extreme Heterogeneity for Ocean Modeling with Tensors</a></h3><ul class="DLauthors"><li class="nameList">Li Tang</li><li class="nameList">Philip Jones</li><li class="nameList Last">Scott Pakin</li></ul><div class="DLabstract"><div style="display:inline">
<p>Specialized processors designed to accelerate tensor operations are evolving faster than conventional processors. This trend of architectural innovations greatly benefits artificial intelligence (AI) workloads. However, it is unknown how well AI-optimized accelerators can be retargeted to scientific applications. To answer this question we explore (1) whether a typical scientific modeling kernel can be mapped efficiently to tensor operations and (2) whether this approach is portable across diverse processors and AI accelerators. In this paper we implement two versions of tracer advection in an ocean-modeling application using PyTorch and evaluate these on one CPU, two GPUs, and Google's TPU. Our findings are that scientific modeling can observe both a performance boost and improved portability by mapping key computational kernels to tensor operations.</p>
</div></div>

</div></div></body></html>
4 changes: 4 additions & 0 deletions _data/OpenTOC.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -1277,3 +1277,7 @@
event: LCTES
year: 2023
title: "Proceedings of the 24th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems"
-
event: ExHET
year: 2023
title: "Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions"

0 comments on commit 210f7d1

Please sign in to comment.