Presenter Index Full Program · Presenters · Organizations · Search Program - A B C D E F G H I J K L M N O P Q R S T U V W X Y Z - Mark Duffield, Kees Vissers, Oliver Gunasekara - Amazon Web Services, Xilinx Inc, NGCodec Keynote: FPGAs in AWS and First Use Cases (joint talk by AWS, NGcodec, and Xilinx) Return to Top A Omar Aaziz New Mexico State University P73: HPC Production Job Quality Assessment Moustafa AbdelBaky Rutgers University Submarine: A Subscription-Based Data Streaming Framework for Integrating Large Facilities and Advanced Cyberinfrastructure David Abdurachmanov CERN The ARM Software Ecosystem: Are We There Yet? David Abramson University of Queensland Scalable Distributed Infrastructure for Data Intensive Science Bilge Acun University of Illinois Mitigating Variability in HPC Systems and Applications for Performance and Power Efficiency Ross N. Adelman US Army Research Laboratory P39: Extremely Large, Wide-Area Power-Line Models Ferrol Aderholdt Oak Ridge National Laboratory P59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing Environments Vikram Adve University of Illinois Developing an OpenMP Runtime for UVM-Capable GPUs Heterogeneous Parallel Virtual Machine and Parallelism in LLVM Ilya Afanasyev Lomonosov Moscow State University Five-minute presentations by young researchers from around the world - part 2 Hoda Aghaei Khouzani University of Delaware Runtime Solutions to Apply Non-Volatile Memories in Future Computer Systems Danial Aghajarian Georgia State University A Heterogeneous HPC Platform for Ill-Structured Spatial Join Processing Abhinav Agrawal North Carolina State University Leveraging Near Data Processing for High-Performance Checkpoint/Restart Kunal Agrawal Washington University in St. Louis Keynote: Teaching Sound Principles and Good Practices for Parallel Algorithms. Khalid Ahmad University of Utah Automatic Testing of OpenACC Applications Dong H. Ahn Lawrence Livermore National Laboratory P94: Fully Hierarchical Scheduling: Paving the Way to Exascale Workloads Behzad R. Ahrabi University of Wyoming P28: High-Fidelity Blade-Resolved Wind Plant Modeling James Ahrens Los Alamos National Laboratory Cosmological Particle Data Compression in Practice P53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaView Alex Aiken Stanford University Control Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical Regions The Legion Programming Model Mark Ainsworth Brown University Introduction - The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2) MGARD: A Multilevel Technique for Compression of Floating-Point Data The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2) Asma H. Al-rawi Intel Corporation P95: GEOPM: A Scalable Open Runtime Framework for Power Management Sadaf Alam Swiss National Supercomputing Centre How Serious Are We About the Convergence Between HPC and Big Data? Best Practices for Architecting Performance and Capacity in the Burst Buffer Era Interactivity in Supercomputing Jay Alameda University of Illinois Fourth SC Workshop on Best Practices for HPC Training Carl Albing US Naval Academy Fourth SC Workshop on Best Practices for HPC Training Ben Albrecht Cray Inc Cosmological Particle-Mesh Simulations in Chapel Nia Alexandrov Hartree Centre Fourth SC Workshop on Best Practices for HPC Training Vassil Alexandrov Barcelona Supercomputing Center Fourth SC Workshop on Best Practices for HPC Training Introduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems Invited Talk - On Improved Monte Carlo Hybrid Methods for Preconditioner Computations 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems Yuri Alexeev Argonne National Laboratory An Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi Processor P42: TRIP: An Ultra-Low Latency, TeraOps/s Reconfigurable Inference Processor for Multi-Layer Perceptrons P30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon Phi P37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum Chemistry Momme Allalen Leibniz Supercomputing Centre P08: Performance Optimization of Matrix-free Finite-Element Algorithms within deal.II Graham Allan University of Minnesota P60: Managing dbGaP Data with Stratus, a Research Cloud for Protected Data William Allcock Argonne National Laboratory P32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDs Randy Allen Mentor Graphics The Challenges Faced by OpenACC Compilers Tyler Allen Clemson University Performance and Energy Usage of Workloads on KNL and Haswell Architectures Amani Alonazi King Abdullah University of Science and Technology Five-minute presentations by young researchers from around the world - part 2 Ilkay Altintas San Diego Supercomputer Center A Machine Learning Approach for Modular Workflow Performance Prediction Alper Altuntas National Center for Atmospheric Research Verifying Concurrency in an Adaptive Ocean Circulation Model Rommie Amaro University of California, San Diego Molecular Simulation at the Mesoscale Marcos Amarís University of Sao Paulo Performance Prediction Modeling of GPU Applications Abdelhalim Amer Argonne National Laboratory Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 MPICH: A High-Performance Open-Source MPI Implementation Sadika Amreen University of Tennessee Position Paper: Experiences on Clustering High-Dimensional Data Using pbdR Amit Amritkar University of Houston Vistas in Advanced Computing Jefferson Amstutz Intel Corporation Flexible In Situ Visualization of LAMMPS Simulations Jooneun An Korea Institute of Science and Technology Information Visualization of Decision-Making Support (DMS) Information for Responding to a Typhoon-Induced Disaster James Ang Sandia National Laboratories Exascale Challenges and Opportunities Rushil Anirudh Lawrence Livermore National Laboratory Performance Modeling under Resource Constraints Using Deep Transfer Learning P75: Model-Agnostic Influence Analysis for Performance Data Katie Antypas National Energy Research Scientific Computing Center How Serious Are We About the Convergence Between HPC and Big Data? Parallel I/O in Practice Hartwig Antz Karlsruhe Institute of Technology University of Tennessee Overcoming Load Imbalance for Irregular Sparse Matrices Hartwig Anzt University of Tennessee Karlsruhe Institute of Technology Flexible Batched Sparse Matrix-Vector Product on GPUs Yulong Ao Chinese Academy of Sciences Five-minute presentations by young researchers from around the world - part 1 Takayuki Aoki Tokyo Institute of Technology Hybrid Fortran: High Productivity GPU Porting Framework Applied to Japanese Weather Prediction Model David Appelhans IBM Leveraging NVLINK and Asynchronous Data Transfer to Scale Beyond the Memory Capacity of GPUs Charles Archer Intel Corporation Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Manuel Arenaz University of A Coruña and Appentra Solutions Parallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACC Dorian Arnold Emory University Silent Errors in HPC Systems Experiencing HPC for Undergraduates: Careers in HPC Forming Strong Networks and Collaborations Connections II: Connecting with Mentors Forming Connections I: Connecting Sideways, with Ourselves and Our Peers Yuuichi Asahi French Alternative Energies and Atomic Energy Commission Application of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore Platforms Mitsuteru Asai Kyushu University P21: The First Real-Scale DEM Simulation of a Sandbox Experiment Using 2.4 Billion Particles Mark Asch University of Picardie Total SA Big Data and Exascale Computing (BDEC) Community Report Samar Aseeri King Abdullah University of Science and Technology A Comparison of Distributed Memory Fast Fourier Transform (FFT) Library Packages Rafael Asenjo University of Malaga Expressing Heterogeneous Parallelism in C++ with Intel Threading Building Blocks Thomas Ashby IMEC P62: How To Do Machine Learning on Big Clusters Joshua Asplund Lawrence Livermore National Laboratory DataRaceBench: A Benchmark Suite for Systematic Evaluation of Data Race Detection Tools Danny Auble SchedMD LLC Slurm User Group Meeting Guillaume Aupy French Institute for Research in Computer Science and Automation (INRIA) Periodic I/O Scheduling for Supercomputers Brian Austin Lawrence Berkeley National Laboratory Performance and Energy Usage of Workloads on KNL and Haswell Architectures Galactos: Computing the 3-pt Anisotropic Correlation for 2 Billion Galaxies Jeff Autor Hewlett Packard PowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and Control Ammar Ahmad Awan Ohio State University An In-Depth Performance Characterization of CPU- and GPU-Based DNN Training on Modern Architectures A26: Co-Designing MPI Runtimes and Deep Learning Frameworks for Scalable Distributed Training on GPU Clusters Abdulrahman Azab University of Oslo Partnership for Advanced Computing in Europe (PRACE) Containers in HPC Return to Top B Abdel-Hameed Badawy New Mexico State University A Scalable Analytical Memory Model for CPU Performance Prediction David Bader Georgia Institute of Technology 15th Graph500 List Michael Bader Technical University Munich Extreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake Materials and Chemistry Frank Baetke Hewlett Packard Enterprise BeeGFS - Architecture, Implementation Examples, and Future Development Lustre Community BoF: Lustre Deployments for the Next 5 Years Saurabh Bagchi Purdue University Snowpack: Efficient Parameter Choice for GPU Kernels via Static Analysis and Statistical Prediction Anna Maria Bailey Lawrence Livermore National Laboratory Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Seonmyeong Bak University of Illinois Integrating OpenMP into the Charm++ Programming Model Allison H. Baker National Center for Atmospheric Research Quality Assurance and Error Identification for the Community Earth System Model Brandon Baker Intel Corporation P95: GEOPM: A Scalable Open Runtime Framework for Power Management Jason Bakos University of South Carolina Introduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic Pavan Balaji Argonne National Laboratory Workshop on Exascale MPI (ExaMPI) Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Advanced MPI Programming MPICH: A High-Performance Open-Source MPI Implementation Prasanna Balaprakash Argonne National Laboratory Contemporary Design of Supercomputer Experiments Rahul Bale RIKEN P24: A Deployment of HPC Algorithm into Pre/Post-Processing for Industrial CFD on K-Computer Gabor Daniel Balogh Pazmany Peter Catholic University Comparison of Parallelization Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs Daniel Balouek-Thomert Rutgers University Submarine: A Subscription-Based Data Streaming Framework for Integrating Large Facilities and Advanced Cyberinfrastructure Fabio Banchelli Barcelona Supercomputing Center P71: Is ARM Software Ecosystem Ready for HPC? Kunal Banerjee Intel Corporation P31: Understanding the Performance of Small Convolution Operations for CNN on Intel Architecture Purushotham Bangalore University of Alabama, Birmingham Workshop on Exascale MPI (ExaMPI) P45: Campaign Storage: Erasure Coding with GPUs Neelofer Banglawala University of Edinburgh Women in HPC: Non-Traditional Paths to HPC and How They Can and Do Enrich the Field Lorena Barba George Washington University HPC Software: Is “Cool Stuff” Really Incompatible with Sustainability? Deborah Bard Lawrence Berkeley National Laboratory Galactos: Computing the 3-pt Anisotropic Correlation for 2 Billion Galaxies Getting Started with the Burst Buffer: Using DataWarp Technology Ashley Barker Oak Ridge National Laboratory Small Business and the Exascale Computing Project Kevin Barker Pacific Northwest National Laboratory Energy Efficient Supercomputing (E2SC) Verification of the Extended Roofline Model for Asynchronous Many Task Runtimes P99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big Data Martina Barnas Indiana University Introduction - Workshop on Education for High Performance Computing (EduHPC) Panel: Attracting Women and Underrepresented Minorities to HPC and Data Science Thomas Barr Research Institute at Nationwide Children's Hospital Computational Approaches for Cancer Carlos Jaime Barrios Hernandez Advanced Computing Service for Latin America and the Caribbean Industrial University of Santander Americas HPC Collaboration Andrea Bartolini ETH Zurich P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing Centers Chaitanya Baru National Science Foundation Common Big Data Challenges in Bio, Geo, Climate, and Social Sciences Alexey Bataev IBM Implementing Implicit OpenMP Data Sharing on GPUs Natalie Bates Energy Efficient HPC Working Group Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) The Green500: Trends in Energy-Efficient Supercomputing Total Cost of Ownership and HPC System Procurement State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM) P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing Centers Andrew Bauer Kitware Inc In Situ Summarization with VTK-m In Situ Analysis and Visualization with SENSEI Michael Bauer Nvidia Corporation Control Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical Regions John Baugh North Carolina State University Verifying Concurrency in an Adaptive Ocean Circulation Model Mohammadreza Bayatpour Ohio State University Scalable Reduction Collectives with Data Partitioning-Based Multi-Leader Design Alexandre Bayen Lawrence Berkeley National Laboratory University of California, Berkeley Inference and Control in Routing Games Neelima Bayyapu Argonne National Laboratory MPICH: A High-Performance Open-Source MPI Implementation Daniel Beall Naval Research Laboratory P18: A Parallel Python Implementation of BLAST+ (PPIB) for Characterization of Complex Microbial Consortia Scott Beamer Lawrence Berkeley National Laboratory Research Execution Lee Beausoleil US Department of Defense Panel Discussion: Diversifying the HPC workforce Identifying the Roadblocks Facing Women in your Workforce Fabian Beck University of Duisburg-Essen Introduction - 4th International Workshop on Visual Performance Analytics – VPA 2017 Fourth International Workshop on Visual Performance Analysis – VPA 2017 Gregory Becker Lawrence Livermore National Laboratory Managing HPC Software Complexity with Spack David Beckingsale Lawrence Livermore National Laboratory P76: A Compiler Agnostic and Architecture Aware Predictive Modeling Framework for Kernels Pete Beckman Director, Exascale Technology & Computing Institute Argonne National Laboratory Common Big Data Challenges in Bio, Geo, Climate, and Social Sciences HPC Connects Plenary: The Century of the City Cross-Layer Allocation and Management of Hardware Resources in Shared Memory Nodes Big Data and Exascale Computing (BDEC) Community Report The Internet of Things and HPC: Are They Teaming Up to Work Together? Bradford M. Beckmann Advanced Micro Devices Inc Gravel: Fine-Grain GPU-Initiated Network Messages Izaak Beekman ParaTools P04: Unstructured-Grid CFD Algorithms on Many-Core Architectures Oceane Bel University of California, Santa Cruz CAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning P65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement Learning Kellon Belfon Stony Brook University Experiencing HPC for Undergraduates: Graduate Student Perspective Maxim Belkin University of Illinois Fourth SC Workshop on Best Practices for HPC Training Software Engineering and Reuse in Computational Science and Engineering Gordon Bell Microsoft Thirty Years of the Gordon Bell Prize Francis Belot Atomic Energy and Alternative Energies Commission State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM) Anouar Benali Argonne National Laboratory Embracing a New Era of Highly Efficient and Productive Quantum Monte Carlo Simulations Siegfried Benkner University of Vienna Extending the Open Community Runtime with External Application Support John Bent Seagate Government Solutions The Virtual Institute of I/O and the IO-500 Michael Bentley University of Utah A15: Quantifying Compiler Effects on Code Performance and Reproducibility Using FLiT Brad Benton Advanced Micro Devices Inc GPU Triggered Networking for Intra-Kernel Communications Pavel Benáček CESNET Case Study: Usage of High Level Synthesis in HPC Networking Gheorghe-Teodor Bercea IBM Implementing Implicit OpenMP Data Sharing on GPUs Ben Bergen Los Alamos National Laboratory P63: FleCSPH: a Parallel and Distributed Smoothed Particle Hydrodynamics Framework Based on FleCSI Karen Bergman Columbia University Post Moore Supercomputing Francine Berman Rensselaer Polytechnic Institute Blurring the Lines: High-End Computing and Data Science David Bernholdt Oak Ridge National Laboratory OpenMP 4.5 Validation and Verification Suite Better Scientific Software Software Engineering and Reuse in Computational Science and Engineering Carlo Bertolli IBM Implementing Implicit OpenMP Data Sharing on GPUs Colleen Bertoni Argonne National Laboratory P32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDs Martin Berzins University of Utah Scientific Computing and Imaging Institute Addressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUs Maciej Besta ETH Zurich Scaling Betweenness Centrality Using Communication-Efficient Sparse Matrix Multiplication E. Wes Bethel Lawrence Berkeley National Laboratory Introduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization In Situ Analysis and Visualization with SENSEI ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization Blair Bethwaite Monash University OpenStack For HPC: Best Practices for Optimizing Software-Defined Infrastructure Eugen Betke German Climate Computing Center P57: Adaptive Tier Selection for NetCDF and HDF5 P15: Toward Decoupling the Selection of Compression Algorithms from Quality Constraints Joshua Bevan University of Illinois P20: Facilitating the Scalability of ParSplice for Exascale Testbeds Sridutt Bhalachandra University of North Carolina Using Runtime Energy Optimizations to Improve Energy Efficiency in High Performance Computing Siddharth Bhat International Institute of Information Technology, Hyderabad Optimizing Geometric Multigrid Method Computation Using a DSL Approach Abhinav Bhatele Lawrence Livermore National Laboratory Introduction - 4th International Workshop on Visual Performance Analytics – VPA 2017 ScrubJay: Deriving Knowledge from the Disarray of HPC Performance Data Performance Modeling under Resource Constraints Using Deep Transfer Learning Predicting the Performance Impact of Different Fat-Tree Configurations Fourth International Workshop on Visual Performance Analysis – VPA 2017 Wahid Bhimji Lawrence Berkeley National Laboratory Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data Paolo Bientinesi RWTH Aachen University A01: GEMM-Like Tensor-Tensor Contraction (GETT) Amanda J. Bienz University of Illinois Reducing Communication Costs in the Parallel Algebraic Multigrid Jay Jay Billings Oak Ridge National Laboratory Software Engineers: Careers in Research Robert Bird Los Alamos National Laboratory A Scalable Analytical Memory Model for CPU Performance Prediction George Biros University of Texas Geometry-Oblivious FMM for Compressing Dense SPD Matrices A Framework for Scalable Biophysics-Based Image Analysis Sean Blanchard Los Alamos National Laboratory Experimental and Analytical Study of Xeon Phi Reliability Wesley Bland Intel Corporation Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Gil Bloch Mellanox Technologies Accelerating Big Data Processing and Machine/Deep Learning Middleware on Modern HPC Clusters Michael Blocksome Intel Corporation Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Michaela Blott Xilinx Inc Introduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic David Bock National Center for Supercomputing Applications, University of Illinois Simulation and Visual Representation of Tropical Cyclone-Ocean Interactions Christian Bodenstein Research Center Juelich Supporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML Framework François Bodin University of Rennes European Exascale Projects and Their Global Contributions David Boehme Lawrence Livermore National Laboratory Predicting the Performance Impact of Different Fat-Tree Configurations Stanislav Bohm Technical University of Ostrava P62: How To Do Machine Learning on Big Clusters Taisuke Boku University of Tsukuba Runtime Correctness Checking for Emerging Programming Paradigms Barry Bolding Cray Inc How Serious Are We About the Convergence Between HPC and Big Data? Evan F. Bollig University of Minnesota P60: Managing dbGaP Data with Stratus, a Research Cloud for Protected Data Rosie Bolton Square Kilometre Array Life, the Universe and Computing: The Story of the SKA Telescope Uday Bondhugula Indian Institute of Science Optimizing Geometric Multigrid Method Computation Using a DSL Approach Matthias Book University of Iceland Supporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML Framework Utpal Bora International Institute of Information Technology, Hyderabad Improved Loop Distribution in LLVM Using Polyhedral Dependences Ralph C. Bording Pawsey Supercomputing Centre 4th International Workshop on HPC User Support Tools (HUST-17) HPC Carpentry - Practical, Hands-On HPC Training Andrea Borghesi University of Bologna State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM) P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing Centers Kalina M. Borkiewicz National Center for Supercomputing Applications, University of Illinois Milky Way Analogue Isolated Disk Galaxy First Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the Universe George Bosilca University of Tennessee Dynamic Task Discovery in PaRSEC- A Data-Flow Task-Based Runtime Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice Resilient Programming Environments Open MPI State of the Union XI Charles A. Bouman Purdue University Massively Parallel 3D Image Reconstruction Aurélien Bouteiller University of Tennessee Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice Anne Dara Bowen Texas Advanced Computing Center, University of Texas Physical Signatures of Cancer Metastasis Geoffrey C. Bower Academica Sinica Institute of Astronomy and Astrophysics realfast@VLA Eric Boyer GENCI Total Cost of Ownership and HPC System Procurement Andrew M. Bradley Sandia National Laboratories Designing Vector-Friendly Compact BLAS and LAPACK Kernels Jim Brandt Sandia National Laboratories HPC Systems Monitoring Data in Action Steven R. Brandt Louisiana State University Interactive HPC: Using C++ and HPX Inside Jupyterhub to Write Performant Portable Parallel Code HPC via HTTP: Portable, Scalable Computing Using App Containers and the Agave API David Brayford Leibniz Supercomputing Centre OpenHPC Community BoF Michael J. Brazell University of Wyoming P28: High-Fidelity Blade-Resolved Wind Plant Modeling Marisa Brazil Purdue University Building a Community: Outreach Strategies for Coordinating a Local WHPC Program Panel Discussion: Diversifying the HPC workforce Peer-Timo Bremer Lawrence Livermore National Laboratory ScrubJay: Deriving Knowledge from the Disarray of HPC Performance Data Ronny Brendel Oak Ridge National Laboratory An LLVM Instrumentation Plug-In for Score-P Mauricio Breternitz University Institute of Lisbon GPU Triggered Networking for Intra-Kernel Communications Alys Brett Culham Centre for Fusion Energy Software Engineering and Reuse in Computational Science and Engineering Software Engineers: Careers in Research Sven Breuner ThinkParQ GmbH BeeGFS - Architecture, Implementation Examples, and Future Development John Brevik California State University, Long Beach Probabilistic Guarantees of Execution Duration for Amazon Spot Instances Patrick Bridges University of New Mexico Workshop on Exascale MPI (ExaMPI) Ian Briggs University of Utah P84: PRESAGE: Selective Low Overhead Error Amplification for Easy Detection Ron Brightwell Sandia National Laboratories Workshop on Exascale MPI (ExaMPI) Opening Remarks: MCHPC'17: Workshop on Memory Centric Programming for HPC sPIN: High-Performance Streaming Processing in the Network MCHPC2017: Workshop on Memory Centric Programming for HPC André Brinkmann Johannes Gutenberg University Mainz A Configurable Rule-Based Classful Token Bucket Filter Network Request Scheduler for the Lustre File System Jed Brown University of Colorado, Boulder Contemporary Design of Supercomputer Experiments Maxine Brown University of Illinois, Chicago SAGE2 9th Annual International SC BOF: Scalable Amplified Group Environment for Global Collaboration Nick Brown University of Edinburgh From Outreach to Education to Researcher: Innovative Ways of Expanding the HPC Community Panel Discussion: Diversifying the HPC workforce From Outreach to Education to Researcher - Innovative Ways of Expanding the HPC Community P81: Offloading Python Kernels to Micro-Core Architectures Dana Brunson Oklahoma State University Fourth SC Workshop on Best Practices for HPC Training Kris Bubendorfer Victoria University of Wellington Heuristic Dynamic Workflow Scheduling Ronak Buch University of Illinois Migratable Objects and Task-Based Parallel Programming with Charm++ Robert Budden Pittsburgh Supercomputing Center OpenStack For HPC: Best Practices for Optimizing Software-Defined Infrastructure Reuben Budiardja Oak Ridge National Laboratory Regression Testing and Monitoring Tools Zoran Budimlic Rice University Graph500 on OpenSHMEM: Using a Practical Survey of Past Work to Motivate Novel Algorithmic Developments Gina Bullock North Carolina Agricultural and Technical State University Teaching, Learning and Collaborating through Cloud Computing Online Classes Aydin Buluc Lawrence Berkeley National Laboratory University of California, Berkeley Scaling Deep Learning on GPU and Knights Landing Clusters HPC Graph Toolkits and the GraphBLAS Forum Communication Efficient Methods David Bunde Knox College "Peachy Assignments:" A New Edu* Conference Component Hans-Joachim Bungartz Technical University Munich A Highly Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma Simulations Citius, Altius, Fortius! Sarah Burke-Spolaor West Virginia University realfast@VLA Anastasiia Butko Lawrence Berkeley National Laboratory Workshop for Open Source Supercomputing Bryan J. Butler National Radio Astronomy Observatory realfast@VLA Ali R. Butt Virginia Tech TagIt: An Integrated Indexing and Search Service for File Systems Suren Byna Lawrence Berkeley National Laboratory In-System Processing for Performance Vetria Byrd Clemson University Scientific Visualization & Data Analytics Showcase Posters Scientific Visualization and Data Analytics Showcase Posters Scientific Visualization & Data Analytics Showcase Posters Scientific Visualization & Data Analytics Showcase Reception Return to Top C Katharine Cahill Ohio Supercomputer Center A Proposed Model for Teaching Advanced Parallel Computing and Related Topics Blake Caldwell University of Colorado, Boulder P59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing Environments Rebecca Caldwell Winston-Salem State University Teaching, Learning and Collaborating through Cloud Computing Online Classes Patrice Calegari Bull From HPC-as-a-Service to Deep Learning-as-a-Service Gruia Calinescu Illinois Institute of Technology P12: Multi-Size Optional Offline Caching Algorithms Martin Callaghan University of Leeds HPC Carpentry - Practical, Hands-On HPC Training Scott Callaghan University of Southern California Panel Discussion: Diversifying the HPC workforce The Benefits of Mentoring: Why and How to Set Up a Program rvGAHP – Push-Based Job Submission Using Reverse SSH Connections From Outreach to Education to Researcher - Innovative Ways of Expanding the HPC Community Spencer Callicott Mississippi State University A14: Analysis of Synthetic Graph Generation Methods for Directed Network Graphs Kirk Cameron Virginia Tech Energy Efficient Supercomputing (E2SC) Funding Agencies HPC Impact Showcase: Computational Modeling Andrew Canning Lawrence Berkeley National Laboratory P13: Large-Scale GW Calculations on Pre-Exascale HPC Systems Shane Canon Lawrence Berkeley National Laboratory Container Computing for HPC and Scientific Workflows Containers in HPC Christopher M. Cantalupo Intel Corporation P95: GEOPM: A Scalable Open Runtime Framework for Power Management Franck Cappello Argonne National Laboratory Introduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic Compression of Scientific Data Reconfigurable Computing in Exascale P37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum Chemistry Emerging Technologies Showcase (Day 3) H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic Emerging Technologies Showcase (Day 1) Emerging Technologies Showcase (Day 2) Danilo Carastan-Santos Federal University of ABC, Santo André, Brazil University of Grenoble Obtaining Dynamic Scheduling Policies with Simulation and Machine Learning Experiencing HPC for Undergraduates: Graduate Student Perspective Lawrence Carin Duke University Introduction - Machine Learning in HPC Environments Richard Carlson US Department of Energy Small Business and the Exascale Computing Project William Carlson Institute for Defense Analyses Keynote: Shared Memory HPC Programming: Past, Present and Future PGAS Applications Workshop Panel Marcelo Amaral Barcelona Supercomputing Center Topology-Aware GPU Scheduling for Learning Workloads in Cloud Environments Philip Carns Argonne National Laboratory Analyzing Parallel I/O Jeffrey D. Carpenter National Center for Supercomputing Applications, University of Illinois Milky Way Analogue Isolated Disk Galaxy First Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the Universe David Carrera Barcelona Supercomputing Center Topology-Aware GPU Scheduling for Learning Workloads in Cloud Environments Jeffrey C. Carver University of Alabama Introduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017) Software Engineering and Reuse in Computational Science and Engineering Software Engineers: Careers in Research The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017) Dan Cassidy Los Alamos National Laboratory P47: Understanding Congestion on Omni-Path Fabrics Ralph Castain Intel Corporation Charting the PMIx Roadmap Vito Giovanni Castellana Pacific Northwest National Laboratory Introduction - IA^3 2017 - 7th Workshop on Irregular Applications: Architectures and Algorithms IA^3 2017 - 7th Workshop on Irregular Applications: Architectures and Algorithms Charlie Catlett Director, Urban Center for Computation & Data Argonne National Laboratory HPC Connects Plenary: The Century of the City John Cavazos University of Delaware P76: A Compiler Agnostic and Architecture Aware Predictive Modeling Framework for Kernels Carlo Cavazzoni CINECA State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM) Aurelien Cavelan University of Basel Resilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance Analysis Cris Cecka Nvidia Corporation Low Communication FMM-Accelerated FFT on GPUs Batched, Reproducible, and Reduced Precision BLAS Milind Chabbi Independent Path-Synchronous Performance Monitoring in HPC Interconnection Networks with Source-Code Attribution Sourav Chakraborty Ohio State University Scalable Reduction Collectives with Data Partitioning-Based Multi-Leader Design Bradford L. Chamberlain Cray Inc Introduction - PAW 2017: The 2nd Annual PGAS Applications Workshop PGAS Applications Workshop Panel Henry Chan Argonne National Laboratory Visualizing Silicene Growth Through Island Migration and Coalescence Sunita Chandrasekaran University of Delaware Introduction - Fourth Workshop on Accelerator Programming Using Directives (WACCPD) Introduction - Women in HPC: Diversifying the HPC Community OpenMP 4.5 Validation and Verification Suite An Efficient Data Layout Transformation Algorithm for Locality-Aware Parallel Sparse FFT The OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained Impact Experiencing HPC for Undergraduates: Careers in HPC OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap Fourth Workshop on Accelerator Programming Using Directives (WACCPD) Choongseok Chang Princeton University Facing the Big Data Challenge in the Fusion Code XGC Kenneth Chang University of California, Santa Cruz CAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement Learning P65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement Learning Barbara Chapman Stony Brook University Benchmarking and Evaluating Unified Memory for OpenMP GPU Offloading OpenMP Common Core: A “Hands-On” Exploration OpenSHMEM in the Era of Exascale Dylan Chapp University of Delaware A17: Toward Capturing Nondeterminism Motifs in HPC Applications Kyle Chard University of Chicago Probabilistic Guarantees of Execution Duration for Amazon Spot Instances Ryan Chard Argonne National Laboratory Probabilistic Guarantees of Execution Duration for Amazon Spot Instances Niladrish Chatterjee Nvidia Corporation Toward Standardized Near-Data Processing with Unrestricted Data Placement for GPUs Bhaskar Chaudhury Dhirubhai Ambani Institute of Information and Communication Technology P27: Parallelization of the Particle-In-Cell Monte Carlo Collision (PIC-MCC) Algorithm for Plasma Simulation on Intel MIC Xeon Phi Architecture Abhishek Chaurasia FWDNXT Inc Snowflake: Efficient Accelerator for Deep Neural Networks Shuai Che Advanced Micro Devices Inc Gravel: Fine-Grain GPU-Initiated Network Messages Bingwei Chen Tsinghua University 15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz Scenarios Changsheng Chen University of Massachusetts, Dartmouth Sun Yat-Sen University Visualizations of a High-Resolution Global-Regional Nested, Ice-Sea-Wave Coupled Ocean Model System Cheng Chen Data Storage Institute National University of Singapore Transactional NVM Cache with High Performance and Crash Consistency Feng Chen University of Texas Advanced Manycore Programming (KNL) Hsing-bung Chen Los Alamos National Laboratory P55: Incorporating Proactive Data Rescue into ZFS Disk Recovery for Enhanced Storage Reliability Jieyang Chen University of California, Riverside Correcting Soft Errors Online in Fast Fourier Transform Tong Chen IBM Implementing Implicit OpenMP Data Sharing on GPUs Xiaofei Chen Southern University of Science and Technology, China 15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz Scenarios Xinyu Chen University of New Mexico P53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaView Yen Chen Chen National Taiwan University A03: A High-Speed Algorithm for Genome-Wide Association Studies on Multi-GPU Systems Zizhong Chen University of California, Riverside Correcting Soft Errors Online in Fast Fourier Transform ParaStack: Efficient Hang Detection for MPI Programs at Large Scale Sai P. Chenna University of Florida A FPGA-Pipelined Approach for Accelerated Discrete-Event Simulation of HPC Systems Gopinath Chennupati Los Alamos National Laboratory A Scalable Analytical Memory Model for CPU Performance Prediction Mathew J. Cherukara Argonne National Laboratory Visualizing Silicene Growth Through Island Migration and Coalescence Naveen Cherukuri Intel Corporation Run-to-Run Variability on Xeon Phi Based Cray XC Systems Kazem Cheshmi Rutgers University Sympiler: Transforming Sparse Matrix Codes by Decoupling Symbolic Analysis Weng Cho Chew University of Illinois P16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering Solutions Andrew Chien University of Chicago Resilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance Analysis Bruce Childers University of Pittsburgh Reproducibility and Uncertainty in High Performance Computing Wendy K. Cho National Center for Supercomputing Applications, University of Illinois P33: Massively Parallel Evolutionary Computation for Empowering Electoral Reform: Quantifying Gerrymandering via Multi-objective Optimization and Statistical Analysis Jaemin Choi University of Illinois at Urbana-Champaign Migratable Objects and Task-Based Parallel Programming with Charm++ A21: Runtime Support for Concurrent Execution of Overdecomposed Heterogeneous Tasks Andrew Y. Choliy Rutgers University P12: Multi-Size Optional Offline Caching Algorithms Fred Chong University of Chicago Quantum Computing and Irregular Applications Jerry chou National Tsing Hua University, Taiwan Optimizing the Query Performance of Block Index Through Data Analysis and I/O Modeling Edmond Chow Georgia Institute of Technology Distributed Southwell: An Iterative Method with Low Communication Costs Invited Talks 3 Invited Talks 4 AJ Christensen National Center for Supercomputing Applications, University of Illinois Milky Way Analogue Isolated Disk Galaxy First Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the Universe Ching-Hsiang Chu Ohio State University A27: High-Performance and Scalable Broadcast Schemes for Deep Learning on GPU Clusters Pi-Yueh Chuang George Washington University An Example of Porting PETSc Applications to Heterogeneous Platforms with OpenACC Neil Chue Hong University of Edinburgh Introduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017) Software Engineering and Reuse in Computational Science and Engineering Software Engineers: Careers in Research The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017) Ryan Chui National Center for Supercomputing Applications, University of Illinois P38: Benchmarking Parallelized File Aggregation Tools for Large Scale Data Management Sudheer Chunduri Argonne National Laboratory Run-to-Run Variability on Xeon Phi Based Cray XC Systems IHsin Chung IBM Towards a Composable Computer System Vladimir Chupakhin Janssen Global Services LLC P62: How To Do Machine Learning on Big Clusters Michael Chuvelev Intel Corporation Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Vojtech Cima Technical University of Ostrava P62: How To Do Machine Learning on Big Clusters Florina M. Ciorba University of Basel P74: A Methodology for Bridging the Native and Simulated Executions of Parallel Applications Selim Ciraci Microsoft Introduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017) The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017) Raymond C. Clay III Sandia National Laboratories Embracing a New Era of Highly Efficient and Productive Quantum Monte Carlo Simulations David Clifton ANSYS Inc HPC Systems Professionals Workshop Thomas Clune NASA Goddard Space Flight Center pFlogger: The Parallel Fortran Logging Framework for HPC Applications Richard Coffey Argonne National Laboratory Fourth SC Workshop on Best Practices for HPC Training HPC Education: Meeting of the SIGHPC Education Chapter Paul Coffman Argonne National Laboratory Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Susan Coghlan Argonne National Laboratory Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Albert Cohen French Institute for Research in Computer Science and Automation (INRIA) Implementation of a Cache Miss Calculator in LLVM/Polly Gary Collins University of Tennessee Flexible Batched Sparse Matrix-Vector Product on GPUs Toni Collis Appentra Solutions, Women in High Performance Computing Embracing Diversity: the Benefits Panel Discussion: Diversifying the HPC workforce Introduction - Women in HPC: Diversifying the HPC Community Career Panel Discussion: Hints and Tips to Progress Your Career Workshop Outcomes and Closing Speed Networking Women in HPC: Non-Traditional Paths to HPC and How They Can and Do Enrich the Field Recruitment: How to Build Diverse Teams Women in HPC: Diversifying the HPC Workforce Guojing Cong IBM Accelerating Deep Neural Network Learning for Speech Recognition on a Cluster of GPUs Paul Constantine University of Colorado, Boulder Contemporary Design of Supercomputer Experiments Mike Conway Renaissance Computing Institute Virtualization Ecosystems – Supporting Increasingly Complex Scientific Applications Steve Conway Hyperion Research A Taxonomy of HPDA Algorithms Blurring the Lines: High-End Computing and Data Science Jeanine Cook Sandia National Laboratories Time Management Jonathan Cook New Mexico State University P73: HPC Production Job Quality Assessment James Coomer DataDirect Networks Best Practices for Architecting Performance and Capacity in the Burst Buffer Era Burst Buffers: Flash in the Pan? Marcin Copik RWTH Aachen University A05: Parallel Prefix Algorithms for the Registration of Arbitrarily Long Electron Micrograph Series Thomas Corcoran Lawrence Berkeley National Laboratory P36: A Novel Feature-Preserving Spatial Mapping for Deep Learning Classification of Ras Structures Anthony Costa Icahn School of Medicine at Mount Sinai Medical Image Analysis and Visualization Timothy B. Costa Intel Corporation Designing Vector-Friendly Compact BLAS and LAPACK Kernels Batched, Reproducible, and Reduced Precision BLAS Jim Cownie Intel Corporation LLVM in HPC: Uses and Desires OpenMP® is Twenty. Where Is It Going? David Cox Harvard University Input-Aware Auto-Tuning of Compute-Bound HPC Kernels Donna J. Cox National Center for Supercomputing Applications, University of Illinois Milky Way Analogue Isolated Disk Galaxy First Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the Universe Silvia Crivelli Lawrence Berkeley National Laboratory P36: A Novel Feature-Preserving Spatial Mapping for Deep Learning Classification of Ras Structures Peter D. Crossman Los Alamos National Laboratory P07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC Architectures Carlos A. Cruz NASA Goddard Space Flight Center pFlogger: The Parallel Fortran Logging Framework for HPC Applications Xuewen Cui Virginia Tech P82: Performance Evaluation of the NVIDIA Tesla P100: Our Directive-Based Partitioning and Pipelining vs. NVIDIA’s Unified Memory Massimiliano Culpo Swiss Federal Institute of Technology in Lausanne Managing HPC Software Complexity with Spack Eugenio Culurciello FWDNXT Inc Snowflake: Efficient Accelerator for Deep Neural Networks Matthew L. Curry Sandia National Laboratories P45: Campaign Storage: Erasure Coding with GPUs Tony Curtis Stony Brook University OpenSHMEM in the Era of Exascale Return to Top D John D'Ambrosia Ethernet Alliance Huawei The Ethernet Portfolio for HPC Nicholas D'Imperio Brookhaven National Laboratory P34: GPU Acceleration for the Impurity Solver in GW+DMFT Packages Michael D'mello Intel Corporation P30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon Phi Felipe H. da Jornada University of California, Berkeley P13: Large-Scale GW Calculations on Pre-Exascale HPC Systems Tamara Dahlgren Lawrence Livermore National Laboratory P94: Fully Hierarchical Scheduling: Paving the Way to Exascale Workloads Christopher S. Daley Lawrence Berkeley National Laboratory Performance and Energy Usage of Workloads on KNL and Haswell Architectures Patricia Damkroger Intel Corporation Introduction - Women in HPC: Diversifying the HPC Community Early Career Coaching Anthony Danalis University of Tennessee P72: New Developments for PAPI 5.6+ Tharun Kumar Dangeti International Institute of Information Technology, Hyderabad Improved Loop Distribution in LLVM Using Polyhedral Dependences Anwesha Das North Carolina State University P89: Desh: Deep Learning for HPC System Health Resilience Arnab Das University of Utah P84: PRESAGE: Selective Low Overhead Error Amplification for Easy Detection Santanu Das International Institute of Information Technology, Hyderabad Improved Loop Distribution in LLVM Using Polyhedral Dependences Christos Davatzikos University of Pennsylvania A Framework for Scalable Biophysics-Based Image Analysis James Davis University of Warwick An Efficient Task-Based All-Reduce for Machine Learning Applications Miyuru Dayarathna WSO2 Inc Multiple Stream Job Performance Optimization with Source Operator Graph Transformations Andreas de Blanche University West Sweden Tetra Pak P44: Increasing Throughput of Multiprogram HPC Workloads: Evaluating a SMT Co-Scheduling Approach Raphael Y. de Camargo Federal University of ABC, Santo André, Brazil Obtaining Dynamic Scheduling Policies with Simulation and Machine Learning Cees de Laat University of Amsterdam Innovating the Network for Data Intensive Science (INDIS) Gustavo De Leon Los Alamos National Laboratory University of California, Berkeley P54: Investigating Hardware Offloading for Reed-Solomon Encoding Daniel Oliveira Fluminense Federal University Toward Preserving Results Confidentiality in Cloud-Based Scientific Workflows Daniele De Sensi University of Pisa Nornir: A Power-Aware Runtime Support for Parallel Applications Bronis R. de Supinski Lawrence Livermore National Laboratory Advanced OpenMP: Performance and 4.5 Features Mastering Tasking with OpenMP P82: Performance Evaluation of the NVIDIA Tesla P100: Our Directive-Based Partitioning and Pipelining vs. NVIDIA’s Unified Memory Cutting Edge File Systems Tom Deakin University of Bristol P69: Portable Methods for Measuring Cache Hierarchy Performance Diptorup Deb University of North Carolina QUARC: An Optimized DSL Framework Using LLVM Nathan Debardeleben Los Alamos National Laboratory Experimental and Analytical Study of Xeon Phi Reliability P92: Characterization and Comparison of Application Resilience for Serial and Parallel Executions Ewa Deelman Information Sciences Institute, University of Southern California rvGAHP – Push-Based Job Submission Using Reverse SSH Connections Mauro Del Ben Lawrence Berkeley National Laboratory P13: Large-Scale GW Calculations on Pre-Exascale HPC Systems Robert DeLeon University at Buffalo Tracking and Analyzing Job-level Activity Using Open XDMoD, XALT and OGRT Robert L. Deleon University at Buffalo A Slurm Simulator: Implementation and Parametric Analysis Phil Demar Fermi National Laboratory P43: Deep Packet/Flow Analysis Using GPUs David E. DeMarle Kitware Inc Large Scale Visualization with ParaView James Demmel University of California, Berkeley Scaling Deep Learning on GPU and Knights Landing Clusters Linear Algebra Libraries for High-Performance Computing: Scientific Computing with Multicore and Accelerators Paul Demorest National Radio Astronomy Observatory realfast@VLA Nicolas Denoyelle French Institute for Research in Computer Science and Automation (INRIA) Modeling Large Compute Nodes with Heterogeneous Memories with the Cache-Aware Roofline Model John W. Dermer Los Alamos National Laboratory P54: Investigating Hardware Offloading for Reed-Solomon Encoding Jack Deslippe Lawrence Berkeley National Laboratory Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data Galactos: Computing the 3-pt Anisotropic Correlation for 2 Billion Galaxies Hariharan Devarajan Illinois Institute of Technology Evaluating GPGPU Memory Performance Through the C-AMAT Model Open Ethernet Drive: Evolution of Energy-Efficient Storage Technology Mehmet Deveci Sandia National Laboratories Designing Vector-Friendly Compact BLAS and LAPACK Kernels Mike Dewar Numerical Algorithms Group HPC Software: Is “Cool Stuff” Really Incompatible with Sustainability? Salvatore Di Girolamo ETH Zurich sPIN: High-Performance Streaming Processing in the Network A08: Virtualized Big Data: Reproducing Simulation Output on Demand Sheng Di Argonne National Laboratory An Efficient Approach to Lossy Compression with Pointwise Relative Error Bound P37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum Chemistry Lori Diachin Lawrence Livermore National Laboratory Using HPC to Impact US Manufacturing through the HPC4Mfg Program Gerrett Diamond Rensselaer Polytechnic Institute Dynamic Load Balancing of Massively Parallel Unstructured Meshes Philip Diamond Square Kilometre Array Life, the Universe and Computing: The Story of the SKA Telescope Mattias Diener University of Illinois Visualizing, Measuring, and Tuning Adaptive MPI Parameters Integrating OpenMP into the Charm++ Programming Model Mark Dietrich Compute Canada Supercomputing in the Shadow of Giants: Perspectives and Insights from Supercomputing Leaders Outside the “Big 5” Regions and Organizations Americas HPC Collaboration Gary A. Dilts Los Alamos National Laboratory P07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC Architectures Nan Ding Tsinghua University Redesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLight Minh Dinh University of Queensland Five-minute presentations by young researchers from around the world - part 1 Sebastian Doebel Technical University Dresden An LLVM Instrumentation Plug-In for Score-P Douglas Doerfler Lawrence Berkeley National Laboratory Performance and Energy Usage of Workloads on KNL and Haswell Architectures Usability, Scalability and Productivity on Many-Core Processors: Intel Xeon Phi Jiri Dokulil University of Vienna Extending the Open Community Runtime with External Application Support David Domyancic Lawrence Livermore National Laboratory P94: Fully Hierarchical Scheduling: Paving the Way to Exascale Workloads Bin Dong Lawrence Berkeley National Laboratory Optimizing the Query Performance of Block Index Through Data Analysis and I/O Modeling Wenjie Dong Sun Yat-Sen University Visualizations of a High-Resolution Global-Regional Nested, Ice-Sea-Wave Coupled Ocean Model System Jack Dongarra University of Tennessee Introduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems Investigating Half-Precision Arithmetic to Accelerate Dense Linear System Solvers Flexible Batched Sparse Matrix-Vector Product on GPUs Dynamic Task Discovery in PaRSEC- A Data-Flow Task-Based Runtime Keynote - An Overview of High Performance Computing and Challenges for the Future TOP500 - Past, Present, Future Linear Algebra Libraries for High-Performance Computing: Scientific Computing with Multicore and Accelerators Big Data and Exascale Computing (BDEC) Community Report TOP500 Supercomputers Batched, Reproducible, and Reduced Precision BLAS P72: New Developments for PAPI 5.6+ David Donofrio Lawrence Berkeley National Laboratory Workshop for Open Source Supercomputing PARADISE: A ToolFlow to Model Emerging Technologies for the Post-CMOS Era in HPC Reconfigurable Computing in Exascale Rion Dooley University of Texas HPC via HTTP: Portable, Scalable Computing Using App Containers and the Agave API Matthieu Dorier Argonne National Laboratory Supporting Task-level Fault-Tolerance in HPC Workflows by Launching MPI Jobs inside MPI Jobs Matthieu Dreher Argonne National Laboratory In Situ Workflows at Exascale: System Software to the Rescue Nikoli Dryden University of Illinois Lawrence Livermore National Laboratory Toward Scalable Parallel Training of Deep Neural Networks David H.C. Du University of Minnesota P56: ZoneTier: A Zone-Based Storage Tiering and Caching Co-Design to Integrate SSDs with Host-Aware SMR Drives Xiaohui Duan Shandong University Redesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLight Nicolas Dube Hewlett Packard PowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and Control Anshu Dubey Argonne National Laboratory University of Chicago Proposal for a Scientific Software Lifecycle Model Better Scientific Software Multiphysics Pradeep Dubey Intel Corporation Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data Galactos: Computing the 3-pt Anisotropic Correlation for 2 Billion Galaxies Artificial Intelligence and The Virtuous Cycle of Compute Nicolas Dubé Hewlett Packard Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Dmitry Duplyakin University of Utah Contemporary Design of Supercomputer Experiments Earl Duque Intelligent Light Introduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization HPC Powers Wind Energy ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization Thomas Durbin Durbin Engineering Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Dmitry Durnov Intel Corporation Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Matthew Dwyer University of Nebraska Towards Self-Verification in Finite Difference Code Generation Michael D’mello Intel Corporation An Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi Processor Return to Top E Jonathan Eastep Intel Corporation PowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and Control P95: GEOPM: A Scalable Open Runtime Framework for Power Management Joe Eaton Nvidia Corporation Parallel Jaccard and Related Graph Clustering Techniques Jerry Ebalunode University of Houston Vistas in Advanced Computing H. Carter Edwards Sandia National Laboratories Kokkos: Enabling Manycore Performance Portability for C++ Applications and Domain Specific Libraries/Languages Stratos Efstathiadis New York University Second Annual Meeting of the SIGHPC - Big Data Chapter Alexandre Eichenberger IBM Implementing Implicit OpenMP Data Sharing on GPUs Stephan Eidenbenz Los Alamos National Laboratory A Scalable Analytical Memory Model for CPU Performance Prediction Victor Eijkhout University of Texas Advanced Manycore Programming (KNL) Greg Eisenhauer Georgia Institute of Technology Parallel Streaming for In Transit Analysis with Heterogeneous Data Layout Daniel Eisenstein Harvard University Galactos: Computing the 3-pt Anisotropic Correlation for 2 Billion Galaxies Mohamed El-Hadedy University of Illinois RE-HASE: Regular-Expressions Hardware Synthesis Engine Izzat El-Hajj University of Illinois P16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering Solutions Nosayba El-Sayed Massachusetts Institute of Technology Qatar Computing Research Institute Understanding Object-Level Memory Access Patterns Across the Spectrum Ahmed Eleliemy University of Basel P74: A Methodology for Bridging the Native and Simulated Executions of Parallel Applications Sally Ellingson University of Kentucky Deep Learning Michael J. Ellsworth, Jr. IBM Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Nahid Emad University of Versailles Maison de la Simulation Parallel Jaccard and Related Graph Clustering Techniques Runtime Correctness Checking for Emerging Programming Paradigms Joel Emer Nvidia Corporation Understanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and Applications Toshio Endo Tokyo Institute of Technology TSUBAME3.0: A Green, Accelerated, Big-Data Supercomputer Applying Temporal Blocking with a Directive-Based Approach State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM) P05: ooc_cuDNN : A Deep Learning Library Supporting CNNs over GPU Memory Capacity Christian Engelmann Oak Ridge National Laboratory Introduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems Failures in Large Scale Systems: Long-Term Measurement, Analysis, and Implications Characterizing Faults, Errors, and Failures in Extreme-Scale Systems Nicolás Erdödy Open Parallel Ltd SKA: The Ultimate Big Data Project The Internet of Things and HPC: Are They Teaming Up to Work Together? Mattan Erez University of Texas Silent Errors in HPC Systems Rajeev S. Erramilli Los Alamos National Laboratory P07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC Architectures Oscar Esquivel-Flores Monterrey Institute of Technology Invited Talk - On Improved Monte Carlo Hybrid Methods for Preconditioner Computations Trilce Estrada University of New Mexico Panel: Attracting Women and Underrepresented Minorities to HPC and Data Science Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience Comments P53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaView Jean-Matthieu ETANCELIN University of Reims Champagne-Ardenne P64: romeoLAB : HPC Training Platform on HPC facility Return to Top F Peyman Faizian Florida State University Modeling UGAL on the Dragonfly Topology A Comparative Study of SDN and Adaptive Routing on Dragonfly Networks Alessandro Fanfarillo National Center for Atmospheric Research Performance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray Fortran Aiman Fang University of Chicago Resilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance Analysis Jian Fang Delft University of Technology Adopting OpenCAPI for High Bandwidth Database Accelerators Massimiliano Fatica Nvidia Corporation A Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU Systems Farzad Fatollahi-Fard Lawrence Berkeley National Laboratory Workshop for Open Source Supercomputing Christian Feld Juelich Supercomputing Center Hands-On Practical Hybrid Parallel Application Performance Engineering Alexandre Fender Nvidia Corporation University of Versailles Parallel Jaccard and Related Graph Clustering Techniques Wu Feng Virginia Tech The Green500: Trends in Energy-Efficient Supercomputing Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) P82: Performance Evaluation of the NVIDIA Tesla P100: Our Directive-Based Partitioning and Pipelining vs. NVIDIA’s Unified Memory John Feo Pacific Northwest National Laboratory Introduction - IA^3 2017 - 7th Workshop on Irregular Applications: Architectures and Algorithms IA^3 2017 - 7th Workshop on Irregular Applications: Architectures and Algorithms Charles R. Ferenbaugh Los Alamos National Laboratory P07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC Architectures Mark Fernandez Hewlett Packard Enterprise HPC in Space: Supercomputing at 17,500 MPH Rafael Ferreira da Silva University of Southern California On the Use of Burst Buffers for Accelerating Data-Intensive Scientific Workflows Kurt B. Ferreira Sandia National Laboratories P93: Spacehog: Evaluating the Costs of Dedicating Resources to In Situ Analysis Nicola Ferrier Argonne National Laboratory Introduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization Daniel A. Feshbach Haverford College A20: Correctness Verification and Boundary Conditions for Chapel Iterator-Based Loop Optimization Adam Fidel Texas A&M University Bounded Asynchrony and Nested Parallelism for Scalable Graph Processing Steve Fields IBM OpenCAPI: High Performance, Host-Agnostic, Coherent Accelerator Interface Weronika Filinger University of Edinburgh From Outreach to Education to Researcher - Innovative Ways of Expanding the HPC Community Salvatore Filippone Cranfield University Introduction - PAW 2017: The 2nd Annual PGAS Applications Workshop Hal Finkel Argonne National Laboratory FPGAs for Supercomputing? Progress and Challenges Introduction - LLVM-HPC2017: Fourth Workshop on the LLVM Compiler Infrastructure in HPC Developing an OpenMP Runtime for UVM-Capable GPUs Benchmarking and Evaluating Unified Memory for OpenMP GPU Offloading OpenMP 4.5 Validation and Verification Suite Concluding Remarks – LLVM-HPC2017 Distributed and Heterogeneous Programming in C++ for HPC LLVM in HPC: Uses and Desires LLVM-HPC2017: Fourth Workshop on the LLVM Compiler Infrastructure in HPC Paul Fischer University of Illinois Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Goran Flegar Jaume I University Overcoming Load Imbalance for Irregular Sparse Matrices Flexible Batched Sparse Matrix-Vector Product on GPUs Kermin Fleming Intel Corporation LESS: Loop Nest Execution Strategies for Spatial Architectures Fernanda Foertter Oak Ridge National Laboratory Fourth SC Workshop on Best Practices for HPC Training Career Panel Discussion: Hints and Tips to Progress Your Career Overcoming the Confidence Gap Parallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACC The OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained Impact An Example of Porting PETSc Applications to Heterogeneous Platforms with OpenACC Kokkos: Enabling Manycore Performance Portability for C++ Applications and Domain Specific Libraries/Languages OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap HPC Education: Meeting of the SIGHPC Education Chapter Interactivity in Supercomputing Mike Folk HDF Group Software Engineering and Reuse in Computational Science and Engineering John Fonner Texas Advanced Computing Center, University of Texas HPC via HTTP: Portable, Scalable Computing Using App Containers and the Agave API John C. Forbes Harvard University Harvard-Smithsonian Center for Astrophysics Milky Way Analogue Isolated Disk Galaxy Andrea Formisano University of Perugia Accelerating Energy Games Solvers on Modern Architectures Ian Foster Argonne National Laboratory Introduction - The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2) Cloud Computing for Science and Engineering The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2) Pouya Fotouhi University of California, Davis P50: Energy-Efficient and Scalable Bio-Inspired Nanophotonic Computing P49: Toward Exascale HPC Systems: Exploiting Advances in High Bandwidth Memory (HBM2) through Scalable All-to-All Optical Interconnect Architectures Yvan Fournier EDF France Melissa: Large Scale In Transit Global Sensitivity Analysis Avoiding Intermediate Files Robert J. Fowler University of North Carolina QUARC: An Optimized DSL Framework Using LLVM William Fox Georgia Institute of Technology University of California, San Francisco E-HPC: A Library for Elastic Resource Management in HPC Environments Franz Franchetti Carnegie Mellon University P06: Large Scale FFT-Based Stress-Strain Simulations with Irregular Domain Decomposition IA^3 Debate Tommy Franczak Northern Illinois University A Path from Serial Execution to Hybrid Parallelization for Learning HPC Robert Freeman Jr Harvard University HPC Carpentry - Practical, Hands-On HPC Training Bernhard Friebe Intel Corporation Enabling FPGAs for the Software Developers Brian Friesen National Energy Research Scientific Computing Center Performance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray Fortran Galactos: Computing the 3-pt Anisotropic Correlation for 2 Billion Galaxies Haohan Fu Tsinghua University National Supercomputing Center, Wuxi Redesigning CAM-SE for Peta-Flops Performance on Sunway TaihuLight Lessons on Integrating and Utilizing 10 Million Cores: Experience of Sunway TaihuLight Redesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLight 15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz Scenarios Song Fu University of North Texas P55: Incorporating Proactive Data Rescue into ZFS Disk Recovery for Enhanced Storage Reliability Akihiro Fujii Kogakuin University P14: Robust SA-AMG Solver by Extraction of Near-Kernel Vectors Katsuki Fujisawa Kyushu University National Institute of Advanced Industrial Science and Technology Cyber-Physical System and Industrial Applications of Large-Scale Graph Analysis and Optimization Problems P78: Performance Evaluation of Graph500 Considering CPU-DRAM Power Shifting Hajime Fujita Intel Corporation Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 Kohei Fujita University of Tokyo RIKEN Implicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACC P09: Adaptive Multistep Predictor for Accelerating Dynamic Implicit Finite-Element Simulations P23: AI with Super-Computed Data for Monte Carlo Earthquake Hazard Classification Douglas Fuller Red Hat Inc Ceph Applications in HPC Environments Student/Postdoc Job Fair Thomas R. Furlani University at Buffalo A Slurm Simulator: Implementation and Parametric Analysis Tracking and Analyzing Job-level Activity Using Open XDMoD, XALT and OGRT Mikito Furuichi Japan Agency for Marine-Earth Science and Technology P21: The First Real-Scale DEM Simulation of a Sandbox Experiment Using 2.4 Billion Particles Yasunori Futamura University of Tsukuba Efficient and Scalable Calculation of Complex Band Structure Using Sakurai-Sugiura Method Return to Top G Abhinav Gaba Intel Corporation LLVM Compiler Implementation for Explicit Parallelization and SIMD Vectorization Alice-Agnes Gabriel Ludwig Maximilian University of Munich Extreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake Niall Gaffney University of Texas Virtualization Ecosystems – Supporting Increasingly Complex Scientific Applications Ana Gainaru Vanderbilt University Periodic I/O Scheduling for Supercomputers Kelly Gaither University of Texas Panel Discussion: Diversifying the HPC workforce Introduction - Women in HPC: Diversifying the HPC Community Career Panel Discussion: Hints and Tips to Progress Your Career Hints and Tips for Public Speaking High Performance Computing Education in US Data Science Scientific Visualization & Data Analytics Showcase James Galarowicz Krell Institute How To Analyze the Performance of Parallel Codes 101 Steven M. Gallo University at Buffalo A Slurm Simulator: Implementation and Parametric Analysis Todd Gamblin Lawrence Livermore National Laboratory 4th International Workshop on HPC User Support Tools (HUST-17) Projecting Performance Data Over Simulation Geometry Using SOSflow and Alpine ScrubJay: Deriving Knowledge from the Disarray of HPC Performance Data Performance Modeling under Resource Constraints Using Deep Transfer Learning Predicting the Performance Impact of Different Fat-Tree Configurations Managing HPC Software Complexity with Spack P75: Model-Agnostic Influence Analysis for Performance Data Lin Gan Tsinghua University Redesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLight Aryya Gangopadhyay University of Maryland, Baltimore County Multidisciplinary Education on Big Data + HPC + Atmospheric Sciences Sangram Ganguly NASA Ames Research Center Common Big Data Challenges in Bio, Geo, Climate, and Social Sciences Dennis Gannon Indiana University Cloud Computing for Science and Engineering Guang Gao University of Delaware Verification of the Extended Roofline Model for Asynchronous Many Task Runtimes P99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big Data Tao Gao University of Delaware A23: Evaluation of Data-Intensive Applications on Intel Knights Landing Cluster Eric Garcia Intel Corporation LLVM Compiler Implementation for Explicit Parallelization and SIMD Vectorization Rao V. Garimella Los Alamos National Laboratory P07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC Architectures Michael Garland Nvidia Corporation Parallel Depth-First Search for Directed Acyclic Graphs Nitin A. Gawande Pacific Northwest National Laboratory Evaluating On-Node GPU Interconnects for Deep Learning Workloads Markus Geimer Juelich Supercomputing Center Hands-On Practical Hybrid Parallel Application Performance Engineering Al Geist Oak Ridge National Laboratory Introduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems Brad Geltz Intel Corporation P95: GEOPM: A Scalable Open Runtime Framework for Power Management Ann gentile Sandia National Laboratories HPC Systems Monitoring Data in Action Raffaella Gentilini University of Perugia Accelerating Energy Games Solvers on Modern Architectures Giorgis Georgakoudis Queen's University Belfast REFINE: Realistic Fault Injection via Compiler-Based Instrumentation for Accuracy, Portability and Speed Evangelos Georganas Intel Corporation P31: Understanding the Performance of Small Convolution Operations for CNN on Intel Architecture Alan George University of Pittsburgh Reconfigurable Supercomputing (RSC) Daniel George National Center for Supercomputing Applications, University of Illinois A13: Deep Learning with HPC Simulations for Extracting Hidden Signals: Detecting Gravitational Waves Richard Gerber Lawrence Berkeley National Laboratory Fourth SC Workshop on Best Practices for HPC Training Lisa Gerhardt Lawrence Berkeley National Laboratory Container Computing for HPC and Scientific Workflows Sandra Gesing University of Notre Dame Introduction - WORKS 2017 (12th Workshop on Workflows in Support of Large-Scale Science) Berk Geveci Kitware Inc In Situ Summarization with VTK-m Sheikh K. Ghafoor Tennessee Technological University Introduction - Workshop on Education for High Performance Computing (EduHPC) Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience Comments Amir Gholami University of Texas A Framework for Scalable Biophysics-Based Image Analysis Experiencing HPC for Undergraduates: Graduate Student Perspective Devarshi Ghoshal Lawrence Berkeley National Laboratory E-HPC: A Library for Elastic Resource Management in HPC Environments Paolo Giannozzi University of Udine A Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU Systems Paul Gibbon Forschungszentrum Juelich P87: EoCoE Performance Benchmarking Methodology for Renewable Energy Applications Mike Giles University of Oxford Beyond 16GB: Out-of-Core Stencil Computations P01: Cache-Blocking Tiling of Large Stencil Codes at Runtime Lauren Gillespie Southwestern University P47: Understanding Congestion on Omni-Path Fabrics Ladina Gilly Swiss National Supercomputing Centre Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Alfredo Gimenez Lawrence Livermore National Laboratory University of California, Davis ScrubJay: Deriving Knowledge from the Disarray of HPC Performance Data Projecting Performance Data Over Simulation Geometry Using SOSflow and Alpine Judit Gimenez Barcelona Supercomputing Center Introduction - 4th International Workshop on Visual Performance Analytics – VPA 2017 Fourth International Workshop on Visual Performance Analysis – VPA 2017 Benjamin H. Glick Lewis & Clark College A07: Scalable Parallel Scripting in the Cloud Matthias Gobbert University of Maryland, Baltimore County Multidisciplinary Education on Big Data + HPC + Atmospheric Sciences Jens Henrik Goebbert Forschungszentrum Juelich Comprehensive Visualization of Large-Scale Simulation Data Linked to Respiratory Flow Computations on HPC Systems Brice Goglin French Institute for Research in Computer Science and Automation (INRIA) Modeling Large Compute Nodes with Heterogeneous Memories with the Cache-Aware Roofline Model Cross-Layer Allocation and Management of Hardware Resources in Shared Memory Nodes Eng Lim Goh Hewlett Packard Enterprise HPC in Space: Supercomputing at 17,500 MPH Ali Murat Gok Argonne National Laboratory Northwestern University P37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum Chemistry Nathan J. Goldbaum National Center for Supercomputing Applications, University of Illinois Milky Way Analogue Isolated Disk Galaxy Deb Goldfarb Intel Corporation Negotiation Skills Career Panel Discussion: Hints and Tips to Progress Your Career Sally Goldman Google Panel: Attracting Women and Underrepresented Minorities to HPC and Data Science Antonio Tedu A. Gomes National Laboratory for Scientific Computing, Brazil Supercomputing in the Shadow of Giants: Perspectives and Insights from Supercomputing Leaders Outside the “Big 5” Regions and Organizations Canstantino Gomez Barcelona Supercomputing Center Five-minute presentations by young researchers from around the world - part 2 Rosalia Gomez Texas Advanced Computing Center, University of Texas High Performance Computing Education in US Data Science Qian Gong Fermi National Laboratory P43: Deep Packet/Flow Analysis Using GPUs Yifan Gong TuSimple Efficient Process Mapping in Geo-Distributed Cloud Data Centers Elsa Gonsiorowski Lawrence Livermore National Laboratory Career Panel Discussion: Hints and Tips to Progress Your Career How to Take the Next Step in Your Career Ganesh Gopalakrishnan University of Utah P84: PRESAGE: Selective Low Overhead Error Amplification for Easy Detection Mark Gordon Iowa State University An Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi Processor P32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDs Mark S. Gordon Iowa State University Porting a GAMESS Computational Chemistry Kernel to FPGAs P30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon Phi Steven Gordon Ohio Supercomputer Center A Proposed Model for Teaching Advanced Parallel Computing and Related Topics HPC Education: Meeting of the SIGHPC Education Chapter Sergei Gorlatch University of Munster PACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming Model Gerard Gorman Imperial College, London Towards Self-Verification in Finite Difference Code Generation Software Engineering and Reuse in Computational Science and Engineering R. Govindarajan Indian Institute of Science HPC Initiatives in India Paolo Grani University of California, Davis P49: Toward Exascale HPC Systems: Exploiting Advances in High Bandwidth Memory (HBM2) through Scalable All-to-All Optical Interconnect Architectures David Grant Oak Ridge National Laboratory Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Ryan Grant Sandia National Laboratories Workshop on Exascale MPI (ExaMPI) sPIN: High-Performance Streaming Processing in the Network PowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and Control Catherine Graves Hewlett Packard Computing with Physics: Analog Computation and Neural Network Classification with a Dot Product Engine Jennifer Green Los Alamos National Laboratory How To Analyze the Performance of Parallel Codes 101 Kevin Griffin Lawrence Livermore National Laboratory Scalable HPC Visualization and Data Analysis Using VisIt Leopold Grinberg IBM P79: Porting the Opacity Client Library to a CPU-GPU Cluster Using OpenMP 4.5 William Gropp University of Illinois Challenges in Programming Extreme Scale Systems Energy Efficiency Gains From Software: Retrospectives and Perspectives Advanced MPI Programming Software Engineering and Reuse in Computational Science and Engineering P70: FFT, FMM, and Multigrid on the Road to Exascale: Performance Challenges and Opportunities Tobias Grosser ETH Zurich Improved Loop Distribution in LLVM Using Polyhedral Dependences Max Grossman Rice University Graph500 on OpenSHMEM: Using a Practical Survey of Past Work to Motivate Novel Algorithmic Developments Chapel-on-X: Exploring Tasking Runtimes for PGAS Languages Robert Grossman University of Chicago Blurring the Lines: High-End Computing and Data Science Sharing Research Data: Data Commons, Distributed Clouds, and Distributed Data Services Paola Grosso University of Amsterdam Innovating the Network for Data Intensive Science (INDIS) Kenny Gruchalla National Renewable Energy Laboratory Contextual Compression of Large-Scale Wind Turbine Array Simulations Hui Guan North Carolina State University Egeria: A Framework for Auto-Construction of HPC Advising Tools through Multi-Layered Natural Language Processing Qiang Guan Los Alamos National Laboratory Ultrascale Systems Research Center P92: Characterization and Comparison of Application Resilience for Serial and Parallel Executions P53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaView Ernesto Guerrero University of Malaga Parallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACC Shashank Gugnani Ohio State University A06: Accelerating Big Data Processing in the Cloud with Scalable Communication and I/O Schemes Pablo Guillen University of Houston Vistas in Advanced Computing Raghul Gunasekaran Oak Ridge National Laboratory Scientific User Behavior and Data-Sharing Trends in a Petascale File System GUIDE: A Scalable Information Directory Service to Collect, Federate, and Analyze Logs for Operational Insights into a Leadership HPC Facility Murat E. Guney Intel Corporation Designing Vector-Friendly Compact BLAS and LAPACK Kernels Peng Guo Chinese Academy of Sciences Chinese Academy of Sciences Tessellating Stencils Xinfei Guo University of Virginia RE-HASE: Regular-Expressions Hardware Synthesis Engine Xuan Guo Oak Ridge National Laboratory Introduction - The Eighth International Workshop on Data-Intensive Computing in the Clouds Yanfei Guo Argonne National Laboratory Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1 MPICH: A High-Performance Open-Source MPI Implementation Anshul Gupta IBM Introduction - Workshop on Education for High Performance Computing (EduHPC) Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience Comments Rajiv Gupta University of California, Riverside ParaStack: Efficient Hang Detection for MPI Programs at Large Scale Ravi Gupta Intel Corporation Snowpack: Efficient Parameter Choice for GPU Kernels via Static Analysis and Statistical Prediction Saurabh Gupta Intel Corporation Failures in Large Scale Systems: Long-Term Measurement, Analysis, and Implications BlazingText: Scaling and Accelerating Word2Vec using Multiple GPUs Levent Gurel University of Illinois P16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering Solutions John L. Gustafson National University of Singapore Posit Research Posit Math Unit (PMU) – A New Approach Toward Exascale Computing Improving Numerical Computation with Practical Tools and Novel Computer Arithmetic Ethan Gutmann National Center for Atmospheric Research Performance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray Fortran Markus Götz Research Center Juelich Supporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML Framework Return to Top H Roland Haas National Center for Supercomputing Applications, University of Illinois P38: Benchmarking Parallelized File Aggregation Tools for Large Scale Data Management Sonja Habbinga Forschungszentrum Juelich Comprehensive Visualization of Large-Scale Simulation Data Linked to Respiratory Flow Computations on HPC Systems Salman Habib Argonne National Laboratory Cosmological Particle Data Compression in Practice Elie Hachem Mines ParisTech Supercomputing for Everyone: Meeting the Growing Needs of Businesses Sebastian Hack Saarland University PACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming Model Daniel Hackenberg Technical University Dresden Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Towards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core Architectures Walker Haddock University of Alabama, Birmingham P45: Campaign Storage: Erasure Coding with GPUs Bilel Hadri King Abdullah University of Science and Technology Fourth SC Workshop on Best Practices for HPC Training Eighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG) Regression Testing and Monitoring Tools State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM) P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing Centers Matthieu Haefele French Alternative Energies and Atomic Energy Commission P87: EoCoE Performance Benchmarking Methodology for Renewable Energy Applications Raphael Tuvia Haftka University of Florida Multi-Fidelity Surrogate Modeling for Application/Architecture Co-Design Hans Hagen University of Kaiserslautern Cosmological Particle Data Compression in Practice Georg Hager University of Erlangen-Nuremberg Node-Level Performance Engineering Christoph Hagleitner IBM Application Porting and Optimization on GPU-Accelerated POWER Architectures Gabriel Hahn Baylor University P35: Using HPC to Model Quantum-Dot Cellular Automata Azzam Haidar University of Tennessee Investigating Half-Precision Arithmetic to Accelerate Dense Linear System Solvers Batched, Reproducible, and Reduced Precision BLAS Michael Haidl University of Munster PACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming Model Mahantesh Halappanavar Pacific Northwest National Laboratory HPC Graph Toolkits and the GraphBLAS Forum Mary Hall