Presenter Index Full Program · Presenters · Organizations · Search Program- A B C D E F G H I J K L M N O P Q R S T U V W X Y Z-Mark Duffield, Kees Vissers, Oliver Gunasekara -Amazon Web Services, Xilinx Inc, NGCodecKeynote: FPGAs in AWS and First Use Cases (joint talk by AWS, NGcodec, and Xilinx)Return to TopAOmar AazizNew Mexico State UniversityP73: HPC Production Job Quality AssessmentMoustafa AbdelBakyRutgers UniversitySubmarine: A Subscription-Based Data Streaming Framework for Integrating Large Facilities and Advanced CyberinfrastructureDavid AbdurachmanovCERNThe ARM Software Ecosystem: Are We There Yet?David AbramsonUniversity of QueenslandScalable Distributed Infrastructure for Data Intensive ScienceBilge AcunUniversity of IllinoisMitigating Variability in HPC Systems and Applications for Performance and Power EfficiencyRoss N. AdelmanUS Army Research LaboratoryP39: Extremely Large, Wide-Area Power-Line ModelsFerrol AderholdtOak Ridge National LaboratoryP59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing EnvironmentsVikram AdveUniversity of IllinoisDeveloping an OpenMP Runtime for UVM-Capable GPUsHeterogeneous Parallel Virtual Machine and Parallelism in LLVMIlya AfanasyevLomonosov Moscow State UniversityFive-minute presentations by young researchers from around the world - part 2Hoda Aghaei KhouzaniUniversity of DelawareRuntime Solutions to Apply Non-Volatile Memories in Future Computer SystemsDanial AghajarianGeorgia State UniversityA Heterogeneous HPC Platform for Ill-Structured Spatial Join ProcessingAbhinav AgrawalNorth Carolina State UniversityLeveraging Near Data Processing for High-Performance Checkpoint/RestartKunal AgrawalWashington University in St. LouisKeynote: Teaching Sound Principles and Good Practices for Parallel Algorithms.Khalid AhmadUniversity of UtahAutomatic Testing of OpenACC ApplicationsDong H. AhnLawrence Livermore National LaboratoryP94: Fully Hierarchical Scheduling: Paving the Way to Exascale WorkloadsBehzad R. AhrabiUniversity of WyomingP28: High-Fidelity Blade-Resolved Wind Plant ModelingJames AhrensLos Alamos National LaboratoryCosmological Particle Data Compression in PracticeP53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaViewAlex AikenStanford UniversityControl Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical RegionsThe Legion Programming ModelMark AinsworthBrown UniversityIntroduction - The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)MGARD: A Multilevel Technique for Compression of Floating-Point DataThe 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)Asma H. Al-rawiIntel CorporationP95: GEOPM: A Scalable Open Runtime Framework for Power ManagementSadaf AlamSwiss National Supercomputing CentreHow Serious Are We About the Convergence Between HPC and Big Data?Best Practices for Architecting Performance and Capacity in the Burst Buffer EraInteractivity in SupercomputingJay AlamedaUniversity of IllinoisFourth SC Workshop on Best Practices for HPC TrainingCarl AlbingUS Naval AcademyFourth SC Workshop on Best Practices for HPC TrainingBen AlbrechtCray IncCosmological Particle-Mesh Simulations in ChapelNia AlexandrovHartree CentreFourth SC Workshop on Best Practices for HPC TrainingVassil AlexandrovBarcelona Supercomputing CenterFourth SC Workshop on Best Practices for HPC TrainingIntroduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale SystemsInvited Talk - On Improved Monte Carlo Hybrid Methods for Preconditioner Computations8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale SystemsYuri AlexeevArgonne National LaboratoryAn Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi ProcessorP42: TRIP: An Ultra-Low Latency, TeraOps/s Reconfigurable Inference Processor for Multi-Layer PerceptronsP30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon PhiP37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum ChemistryMomme AllalenLeibniz Supercomputing CentreP08: Performance Optimization of Matrix-free Finite-Element Algorithms within deal.IIGraham AllanUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataWilliam AllcockArgonne National LaboratoryP32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDsRandy AllenMentor GraphicsThe Challenges Faced by OpenACC CompilersTyler AllenClemson UniversityPerformance and Energy Usage of Workloads on KNL and Haswell ArchitecturesAmani AlonaziKing Abdullah University of Science and TechnologyFive-minute presentations by young researchers from around the world - part 2Ilkay AltintasSan Diego Supercomputer CenterA Machine Learning Approach for Modular Workflow Performance PredictionAlper AltuntasNational Center for Atmospheric ResearchVerifying Concurrency in an Adaptive Ocean Circulation ModelRommie AmaroUniversity of California, San DiegoMolecular Simulation at the MesoscaleMarcos AmarísUniversity of Sao PauloPerformance Prediction Modeling of GPU ApplicationsAbdelhalim AmerArgonne National LaboratoryWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1MPICH: A High-Performance Open-Source MPI ImplementationSadika AmreenUniversity of TennesseePosition Paper: Experiences on Clustering High-Dimensional Data Using pbdRAmit AmritkarUniversity of HoustonVistas in Advanced ComputingJefferson AmstutzIntel CorporationFlexible In Situ Visualization of LAMMPS SimulationsJooneun AnKorea Institute of Science and Technology InformationVisualization of Decision-Making Support (DMS) Information for Responding to a Typhoon-Induced DisasterJames AngSandia National LaboratoriesExascale Challenges and OpportunitiesRushil AnirudhLawrence Livermore National LaboratoryPerformance Modeling under Resource Constraints Using Deep Transfer LearningP75: Model-Agnostic Influence Analysis for Performance DataKatie AntypasNational Energy Research Scientific Computing CenterHow Serious Are We About the Convergence Between HPC and Big Data?Parallel I/O in PracticeHartwig AntzKarlsruhe Institute of TechnologyUniversity of TennesseeOvercoming Load Imbalance for Irregular Sparse MatricesHartwig AnztUniversity of TennesseeKarlsruhe Institute of TechnologyFlexible Batched Sparse Matrix-Vector Product on GPUsYulong AoChinese Academy of SciencesFive-minute presentations by young researchers from around the world - part 1Takayuki AokiTokyo Institute of TechnologyHybrid Fortran: High Productivity GPU Porting Framework Applied to Japanese Weather Prediction ModelDavid AppelhansIBMLeveraging NVLINK and Asynchronous Data Transfer to Scale Beyond the Memory Capacity of GPUsCharles ArcherIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Manuel ArenazUniversity of A Coruña and Appentra SolutionsParallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACCDorian ArnoldEmory UniversitySilent Errors in HPC SystemsExperiencing HPC for Undergraduates: Careers in HPCForming Strong Networks and CollaborationsConnections II: Connecting with MentorsForming Connections I: Connecting Sideways, with Ourselves and Our PeersYuuichi AsahiFrench Alternative Energies and Atomic Energy CommissionApplication of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore PlatformsMitsuteru AsaiKyushu UniversityP21: The First Real-Scale DEM Simulation of a Sandbox Experiment Using 2.4 Billion ParticlesMark AschUniversity of PicardieTotal SABig Data and Exascale Computing (BDEC) Community ReportSamar AseeriKing Abdullah University of Science and TechnologyA Comparison of Distributed Memory Fast Fourier Transform (FFT) Library PackagesRafael AsenjoUniversity of MalagaExpressing Heterogeneous Parallelism in C++ with Intel Threading Building BlocksThomas AshbyIMECP62: How To Do Machine Learning on Big ClustersJoshua AsplundLawrence Livermore National LaboratoryDataRaceBench: A Benchmark Suite for Systematic Evaluation of Data Race Detection ToolsDanny AubleSchedMD LLCSlurm User Group MeetingGuillaume AupyFrench Institute for Research in Computer Science and Automation (INRIA)Periodic I/O Scheduling for SupercomputersBrian AustinLawrence Berkeley National LaboratoryPerformance and Energy Usage of Workloads on KNL and Haswell ArchitecturesGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesJeff AutorHewlett PackardPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlAmmar Ahmad AwanOhio State UniversityAn In-Depth Performance Characterization of CPU- and GPU-Based DNN Training on Modern ArchitecturesA26: Co-Designing MPI Runtimes and Deep Learning Frameworks for Scalable Distributed Training on GPU ClustersAbdulrahman AzabUniversity of OsloPartnership for Advanced Computing in Europe (PRACE)Containers in HPCReturn to TopBAbdel-Hameed BadawyNew Mexico State UniversityA Scalable Analytical Memory Model for CPU Performance PredictionDavid BaderGeorgia Institute of Technology15th Graph500 ListMichael BaderTechnical University MunichExtreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust EarthquakeMaterials and ChemistryFrank BaetkeHewlett Packard EnterpriseBeeGFS - Architecture, Implementation Examples, and Future DevelopmentLustre Community BoF: Lustre Deployments for the Next 5 YearsSaurabh BagchiPurdue UniversitySnowpack: Efficient Parameter Choice for GPU Kernels via Static Analysis and Statistical PredictionAnna Maria BaileyLawrence Livermore National LaboratoryEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Seonmyeong BakUniversity of IllinoisIntegrating OpenMP into the Charm++ Programming ModelAllison H. BakerNational Center for Atmospheric ResearchQuality Assurance and Error Identification for the Community Earth System ModelBrandon BakerIntel CorporationP95: GEOPM: A Scalable Open Runtime Framework for Power ManagementJason BakosUniversity of South CarolinaIntroduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicPavan BalajiArgonne National LaboratoryWorkshop on Exascale MPI (ExaMPI)Why Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Advanced MPI ProgrammingMPICH: A High-Performance Open-Source MPI ImplementationPrasanna BalaprakashArgonne National LaboratoryContemporary Design of Supercomputer ExperimentsRahul BaleRIKENP24: A Deployment of HPC Algorithm into Pre/Post-Processing for Industrial CFD on K-ComputerGabor Daniel BaloghPazmany Peter Catholic UniversityComparison of Parallelization Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUsDaniel Balouek-ThomertRutgers UniversitySubmarine: A Subscription-Based Data Streaming Framework for Integrating Large Facilities and Advanced CyberinfrastructureFabio BanchelliBarcelona Supercomputing CenterP71: Is ARM Software Ecosystem Ready for HPC?Kunal BanerjeeIntel CorporationP31: Understanding the Performance of Small Convolution Operations for CNN on Intel ArchitecturePurushotham BangaloreUniversity of Alabama, BirminghamWorkshop on Exascale MPI (ExaMPI)P45: Campaign Storage: Erasure Coding with GPUsNeelofer BanglawalaUniversity of EdinburghWomen in HPC: Non-Traditional Paths to HPC and How They Can and Do Enrich the FieldLorena BarbaGeorge Washington UniversityHPC Software: Is “Cool Stuff” Really Incompatible with Sustainability?Deborah BardLawrence Berkeley National LaboratoryGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesGetting Started with the Burst Buffer: Using DataWarp TechnologyAshley BarkerOak Ridge National LaboratorySmall Business and the Exascale Computing ProjectKevin BarkerPacific Northwest National LaboratoryEnergy Efficient Supercomputing (E2SC)Verification of the Extended Roofline Model for Asynchronous Many Task RuntimesP99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big DataMartina BarnasIndiana UniversityIntroduction - Workshop on Education for High Performance Computing (EduHPC)Panel: Attracting Women and Underrepresented Minorities to HPC and Data ScienceThomas BarrResearch Institute at Nationwide Children's HospitalComputational Approaches for CancerCarlos Jaime Barrios HernandezAdvanced Computing Service for Latin America and the CaribbeanIndustrial University of SantanderAmericas HPC CollaborationAndrea BartoliniETH ZurichP90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersChaitanya BaruNational Science FoundationCommon Big Data Challenges in Bio, Geo, Climate, and Social SciencesAlexey BataevIBMImplementing Implicit OpenMP Data Sharing on GPUsNatalie BatesEnergy Efficient HPC Working GroupEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)The Green500: Trends in Energy-Efficient SupercomputingTotal Cost of Ownership and HPC System ProcurementState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersAndrew BauerKitware IncIn Situ Summarization with VTK-mIn Situ Analysis and Visualization with SENSEIMichael BauerNvidia CorporationControl Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical RegionsJohn BaughNorth Carolina State UniversityVerifying Concurrency in an Adaptive Ocean Circulation ModelMohammadreza BayatpourOhio State UniversityScalable Reduction Collectives with Data Partitioning-Based Multi-Leader DesignAlexandre BayenLawrence Berkeley National LaboratoryUniversity of California, BerkeleyInference and Control in Routing GamesNeelima BayyapuArgonne National LaboratoryMPICH: A High-Performance Open-Source MPI ImplementationDaniel BeallNaval Research LaboratoryP18: A Parallel Python Implementation of BLAST+ (PPIB) for Characterization of Complex Microbial ConsortiaScott BeamerLawrence Berkeley National LaboratoryResearch ExecutionLee BeausoleilUS Department of DefensePanel Discussion: Diversifying the HPC workforceIdentifying the Roadblocks Facing Women in your WorkforceFabian BeckUniversity of Duisburg-EssenIntroduction - 4th International Workshop on Visual Performance Analytics – VPA 2017Fourth International Workshop on Visual Performance Analysis – VPA 2017Gregory BeckerLawrence Livermore National LaboratoryManaging HPC Software Complexity with SpackDavid BeckingsaleLawrence Livermore National LaboratoryP76: A Compiler Agnostic and Architecture Aware Predictive Modeling Framework for KernelsPete BeckmanDirector, Exascale Technology & Computing InstituteArgonne National LaboratoryCommon Big Data Challenges in Bio, Geo, Climate, and Social SciencesHPC Connects Plenary: The Century of the CityCross-Layer Allocation and Management of Hardware Resources in Shared Memory NodesBig Data and Exascale Computing (BDEC) Community ReportThe Internet of Things and HPC: Are They Teaming Up to Work Together?Bradford M. BeckmannAdvanced Micro Devices IncGravel: Fine-Grain GPU-Initiated Network MessagesIzaak BeekmanParaToolsP04: Unstructured-Grid CFD Algorithms on Many-Core ArchitecturesOceane BelUniversity of California, Santa CruzCAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement LearningP65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement LearningKellon BelfonStony Brook UniversityExperiencing HPC for Undergraduates: Graduate Student PerspectiveMaxim BelkinUniversity of IllinoisFourth SC Workshop on Best Practices for HPC TrainingSoftware Engineering and Reuse in Computational Science and EngineeringGordon BellMicrosoftThirty Years of the Gordon Bell PrizeFrancis BelotAtomic Energy and Alternative Energies CommissionState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)Anouar BenaliArgonne National LaboratoryEmbracing a New Era of Highly Efficient and Productive Quantum Monte Carlo SimulationsSiegfried BenknerUniversity of ViennaExtending the Open Community Runtime with External Application SupportJohn BentSeagate Government SolutionsThe Virtual Institute of I/O and the IO-500Michael BentleyUniversity of UtahA15: Quantifying Compiler Effects on Code Performance and Reproducibility Using FLiTBrad BentonAdvanced Micro Devices IncGPU Triggered Networking for Intra-Kernel CommunicationsPavel BenáčekCESNETCase Study: Usage of High Level Synthesis in HPC NetworkingGheorghe-Teodor BerceaIBMImplementing Implicit OpenMP Data Sharing on GPUsBen BergenLos Alamos National LaboratoryP63: FleCSPH: a Parallel and Distributed Smoothed Particle Hydrodynamics Framework Based on FleCSIKaren BergmanColumbia UniversityPost Moore SupercomputingFrancine BermanRensselaer Polytechnic InstituteBlurring the Lines: High-End Computing and Data ScienceDavid BernholdtOak Ridge National LaboratoryOpenMP 4.5 Validation and Verification SuiteBetter Scientific SoftwareSoftware Engineering and Reuse in Computational Science and EngineeringCarlo BertolliIBMImplementing Implicit OpenMP Data Sharing on GPUsColleen BertoniArgonne National LaboratoryP32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDsMartin BerzinsUniversity of UtahScientific Computing and Imaging InstituteAddressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUsMaciej BestaETH ZurichScaling Betweenness Centrality Using Communication-Efficient Sparse Matrix MultiplicationE. Wes BethelLawrence Berkeley National LaboratoryIntroduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationIn Situ Analysis and Visualization with SENSEIISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationBlair BethwaiteMonash UniversityOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureEugen BetkeGerman Climate Computing CenterP57: Adaptive Tier Selection for NetCDF and HDF5P15: Toward Decoupling the Selection of Compression Algorithms from Quality ConstraintsJoshua BevanUniversity of IllinoisP20: Facilitating the Scalability of ParSplice for Exascale TestbedsSridutt BhalachandraUniversity of North CarolinaUsing Runtime Energy Optimizations to Improve Energy Efficiency in High Performance ComputingSiddharth BhatInternational Institute of Information Technology, HyderabadOptimizing Geometric Multigrid Method Computation Using a DSL ApproachAbhinav BhateleLawrence Livermore National LaboratoryIntroduction - 4th International Workshop on Visual Performance Analytics – VPA 2017ScrubJay: Deriving Knowledge from the Disarray of HPC Performance DataPerformance Modeling under Resource Constraints Using Deep Transfer LearningPredicting the Performance Impact of Different Fat-Tree ConfigurationsFourth International Workshop on Visual Performance Analysis – VPA 2017Wahid BhimjiLawrence Berkeley National LaboratoryDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataPaolo BientinesiRWTH Aachen UniversityA01: GEMM-Like Tensor-Tensor Contraction (GETT)Amanda J. BienzUniversity of IllinoisReducing Communication Costs in the Parallel Algebraic MultigridJay Jay BillingsOak Ridge National LaboratorySoftware Engineers: Careers in ResearchRobert BirdLos Alamos National LaboratoryA Scalable Analytical Memory Model for CPU Performance PredictionGeorge BirosUniversity of TexasGeometry-Oblivious FMM for Compressing Dense SPD MatricesA Framework for Scalable Biophysics-Based Image AnalysisSean BlanchardLos Alamos National LaboratoryExperimental and Analytical Study of Xeon Phi ReliabilityWesley BlandIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Gil BlochMellanox TechnologiesAccelerating Big Data Processing and Machine/Deep Learning Middleware on Modern HPC ClustersMichael BlocksomeIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Michaela BlottXilinx IncIntroduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicDavid BockNational Center for Supercomputing Applications, University of IllinoisSimulation and Visual Representation of Tropical Cyclone-Ocean InteractionsChristian BodensteinResearch Center JuelichSupporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML FrameworkFrançois BodinUniversity of RennesEuropean Exascale Projects and Their Global ContributionsDavid BoehmeLawrence Livermore National LaboratoryPredicting the Performance Impact of Different Fat-Tree ConfigurationsStanislav BohmTechnical University of OstravaP62: How To Do Machine Learning on Big ClustersTaisuke BokuUniversity of TsukubaRuntime Correctness Checking for Emerging Programming ParadigmsBarry BoldingCray IncHow Serious Are We About the Convergence Between HPC and Big Data?Evan F. BolligUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataRosie BoltonSquare Kilometre ArrayLife, the Universe and Computing: The Story of the SKA TelescopeUday BondhugulaIndian Institute of ScienceOptimizing Geometric Multigrid Method Computation Using a DSL ApproachMatthias BookUniversity of IcelandSupporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML FrameworkUtpal BoraInternational Institute of Information Technology, HyderabadImproved Loop Distribution in LLVM Using Polyhedral DependencesRalph C. BordingPawsey Supercomputing Centre4th International Workshop on HPC User Support Tools (HUST-17)HPC Carpentry - Practical, Hands-On HPC TrainingAndrea BorghesiUniversity of BolognaState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersKalina M. BorkiewiczNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseGeorge BosilcaUniversity of TennesseeDynamic Task Discovery in PaRSEC- A Data-Flow Task-Based RuntimeFault-Tolerance for High Performance and Distributed Computing: Theory and PracticeResilient Programming EnvironmentsOpen MPI State of the Union XICharles A. BoumanPurdue UniversityMassively Parallel 3D Image ReconstructionAurélien BouteillerUniversity of TennesseeFault-Tolerance for High Performance and Distributed Computing: Theory and PracticeAnne Dara BowenTexas Advanced Computing Center, University of TexasPhysical Signatures of Cancer MetastasisGeoffrey C. BowerAcademica Sinica Institute of Astronomy and Astrophysicsrealfast@VLAEric BoyerGENCITotal Cost of Ownership and HPC System ProcurementAndrew M. BradleySandia National LaboratoriesDesigning Vector-Friendly Compact BLAS and LAPACK KernelsJim BrandtSandia National LaboratoriesHPC Systems Monitoring Data in ActionSteven R. BrandtLouisiana State UniversityInteractive HPC: Using C++ and HPX Inside Jupyterhub to Write Performant Portable Parallel CodeHPC via HTTP: Portable, Scalable Computing Using App Containers and the Agave APIDavid BrayfordLeibniz Supercomputing CentreOpenHPC Community BoFMichael J. BrazellUniversity of WyomingP28: High-Fidelity Blade-Resolved Wind Plant ModelingMarisa BrazilPurdue UniversityBuilding a Community: Outreach Strategies for Coordinating a Local WHPC ProgramPanel Discussion: Diversifying the HPC workforcePeer-Timo BremerLawrence Livermore National LaboratoryScrubJay: Deriving Knowledge from the Disarray of HPC Performance DataRonny BrendelOak Ridge National LaboratoryAn LLVM Instrumentation Plug-In for Score-PMauricio BreternitzUniversity Institute of LisbonGPU Triggered Networking for Intra-Kernel CommunicationsAlys BrettCulham Centre for Fusion EnergySoftware Engineering and Reuse in Computational Science and EngineeringSoftware Engineers: Careers in ResearchSven BreunerThinkParQ GmbHBeeGFS - Architecture, Implementation Examples, and Future DevelopmentJohn BrevikCalifornia State University, Long BeachProbabilistic Guarantees of Execution Duration for Amazon Spot InstancesPatrick BridgesUniversity of New MexicoWorkshop on Exascale MPI (ExaMPI)Ian BriggsUniversity of UtahP84: PRESAGE: Selective Low Overhead Error Amplification for Easy DetectionRon BrightwellSandia National LaboratoriesWorkshop on Exascale MPI (ExaMPI)Opening Remarks: MCHPC'17: Workshop on Memory Centric Programming for HPCsPIN: High-Performance Streaming Processing in the NetworkMCHPC2017: Workshop on Memory Centric Programming for HPCAndré BrinkmannJohannes Gutenberg University MainzA Configurable Rule-Based Classful Token Bucket Filter Network Request Scheduler for the Lustre File SystemJed BrownUniversity of Colorado, BoulderContemporary Design of Supercomputer ExperimentsMaxine BrownUniversity of Illinois, ChicagoSAGE2 9th Annual International SC BOF: Scalable Amplified Group Environment for Global CollaborationNick BrownUniversity of EdinburghFrom Outreach to Education to Researcher: Innovative Ways of Expanding the HPC CommunityPanel Discussion: Diversifying the HPC workforceFrom Outreach to Education to Researcher - Innovative Ways of Expanding the HPC CommunityP81: Offloading Python Kernels to Micro-Core ArchitecturesDana BrunsonOklahoma State UniversityFourth SC Workshop on Best Practices for HPC TrainingKris BubendorferVictoria University of WellingtonHeuristic Dynamic Workflow SchedulingRonak BuchUniversity of IllinoisMigratable Objects and Task-Based Parallel Programming with Charm++Robert BuddenPittsburgh Supercomputing CenterOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureReuben BudiardjaOak Ridge National LaboratoryRegression Testing and Monitoring ToolsZoran BudimlicRice UniversityGraph500 on OpenSHMEM: Using a Practical Survey of Past Work to Motivate Novel Algorithmic DevelopmentsGina BullockNorth Carolina Agricultural and Technical State UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesAydin BulucLawrence Berkeley National LaboratoryUniversity of California, BerkeleyScaling Deep Learning on GPU and Knights Landing ClustersHPC Graph Toolkits and the GraphBLAS ForumCommunication Efficient MethodsDavid BundeKnox College"Peachy Assignments:" A New Edu* Conference ComponentHans-Joachim BungartzTechnical University MunichA Highly Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma SimulationsCitius, Altius, Fortius!Sarah Burke-SpolaorWest Virginia Universityrealfast@VLAAnastasiia ButkoLawrence Berkeley National LaboratoryWorkshop for Open Source SupercomputingBryan J. ButlerNational Radio Astronomy Observatoryrealfast@VLAAli R. ButtVirginia TechTagIt: An Integrated Indexing and Search Service for File SystemsSuren BynaLawrence Berkeley National LaboratoryIn-System Processing for PerformanceVetria ByrdClemson UniversityScientific Visualization & Data Analytics Showcase PostersScientific Visualization and Data Analytics Showcase PostersScientific Visualization & Data Analytics Showcase PostersScientific Visualization & Data Analytics Showcase ReceptionReturn to TopCKatharine CahillOhio Supercomputer CenterA Proposed Model for Teaching Advanced Parallel Computing and Related TopicsBlake CaldwellUniversity of Colorado, BoulderP59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing EnvironmentsRebecca CaldwellWinston-Salem State UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesPatrice CalegariBullFrom HPC-as-a-Service to Deep Learning-as-a-ServiceGruia CalinescuIllinois Institute of TechnologyP12: Multi-Size Optional Offline Caching AlgorithmsMartin CallaghanUniversity of LeedsHPC Carpentry - Practical, Hands-On HPC TrainingScott CallaghanUniversity of Southern CaliforniaPanel Discussion: Diversifying the HPC workforceThe Benefits of Mentoring: Why and How to Set Up a ProgramrvGAHP – Push-Based Job Submission Using Reverse SSH ConnectionsFrom Outreach to Education to Researcher - Innovative Ways of Expanding the HPC CommunitySpencer CallicottMississippi State UniversityA14: Analysis of Synthetic Graph Generation Methods for Directed Network GraphsKirk CameronVirginia TechEnergy Efficient Supercomputing (E2SC)Funding AgenciesHPC Impact Showcase: Computational ModelingAndrew CanningLawrence Berkeley National LaboratoryP13: Large-Scale GW Calculations on Pre-Exascale HPC SystemsShane CanonLawrence Berkeley National LaboratoryContainer Computing for HPC and Scientific WorkflowsContainers in HPCChristopher M. CantalupoIntel CorporationP95: GEOPM: A Scalable Open Runtime Framework for Power ManagementFranck CappelloArgonne National LaboratoryIntroduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicCompression of Scientific DataReconfigurable Computing in ExascaleP37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum ChemistryEmerging Technologies Showcase (Day 3) H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicEmerging Technologies Showcase (Day 1)Emerging Technologies Showcase (Day 2)Danilo Carastan-SantosFederal University of ABC, Santo André, BrazilUniversity of GrenobleObtaining Dynamic Scheduling Policies with Simulation and Machine LearningExperiencing HPC for Undergraduates: Graduate Student PerspectiveLawrence CarinDuke UniversityIntroduction - Machine Learning in HPC EnvironmentsRichard CarlsonUS Department of EnergySmall Business and the Exascale Computing ProjectWilliam CarlsonInstitute for Defense AnalysesKeynote: Shared Memory HPC Programming: Past, Present and FuturePGAS Applications Workshop PanelMarcelo AmaralBarcelona Supercomputing CenterTopology-Aware GPU Scheduling for Learning Workloads in Cloud EnvironmentsPhilip CarnsArgonne National LaboratoryAnalyzing Parallel I/OJeffrey D. CarpenterNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseDavid CarreraBarcelona Supercomputing CenterTopology-Aware GPU Scheduling for Learning Workloads in Cloud EnvironmentsJeffrey C. CarverUniversity of AlabamaIntroduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Software Engineering and Reuse in Computational Science and EngineeringSoftware Engineers: Careers in ResearchThe 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Dan CassidyLos Alamos National LaboratoryP47: Understanding Congestion on Omni-Path FabricsRalph CastainIntel CorporationCharting the PMIx RoadmapVito Giovanni CastellanaPacific Northwest National LaboratoryIntroduction - IA^3 2017 - 7th Workshop on Irregular Applications: Architectures and AlgorithmsIA^3 2017 - 7th Workshop on Irregular Applications: Architectures and AlgorithmsCharlie CatlettDirector, Urban Center for Computation & DataArgonne National LaboratoryHPC Connects Plenary: The Century of the CityJohn CavazosUniversity of DelawareP76: A Compiler Agnostic and Architecture Aware Predictive Modeling Framework for KernelsCarlo CavazzoniCINECAState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)Aurelien CavelanUniversity of BaselResilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance AnalysisCris CeckaNvidia CorporationLow Communication FMM-Accelerated FFT on GPUsBatched, Reproducible, and Reduced Precision BLASMilind ChabbiIndependentPath-Synchronous Performance Monitoring in HPC Interconnection Networks with Source-Code AttributionSourav ChakrabortyOhio State UniversityScalable Reduction Collectives with Data Partitioning-Based Multi-Leader DesignBradford L. ChamberlainCray IncIntroduction - PAW 2017: The 2nd Annual PGAS Applications WorkshopPGAS Applications Workshop PanelHenry ChanArgonne National LaboratoryVisualizing Silicene Growth Through Island Migration and CoalescenceSunita ChandrasekaranUniversity of DelawareIntroduction - Fourth Workshop on Accelerator Programming Using Directives (WACCPD)Introduction - Women in HPC: Diversifying the HPC CommunityOpenMP 4.5 Validation and Verification SuiteAn Efficient Data Layout Transformation Algorithm for Locality-Aware Parallel Sparse FFTThe OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactExperiencing HPC for Undergraduates: Careers in HPCOpenACC API User Experience, Vendor Reaction, Relevance, and RoadmapFourth Workshop on Accelerator Programming Using Directives (WACCPD)Choongseok ChangPrinceton UniversityFacing the Big Data Challenge in the Fusion Code XGCKenneth ChangUniversity of California, Santa CruzCAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement LearningP65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement LearningBarbara ChapmanStony Brook UniversityBenchmarking and Evaluating Unified Memory for OpenMP GPU OffloadingOpenMP Common Core: A “Hands-On” ExplorationOpenSHMEM in the Era of ExascaleDylan ChappUniversity of DelawareA17: Toward Capturing Nondeterminism Motifs in HPC ApplicationsKyle ChardUniversity of ChicagoProbabilistic Guarantees of Execution Duration for Amazon Spot InstancesRyan ChardArgonne National LaboratoryProbabilistic Guarantees of Execution Duration for Amazon Spot InstancesNiladrish ChatterjeeNvidia CorporationToward Standardized Near-Data Processing with Unrestricted Data Placement for GPUsBhaskar ChaudhuryDhirubhai Ambani Institute of Information and Communication TechnologyP27: Parallelization of the Particle-In-Cell Monte Carlo Collision (PIC-MCC) Algorithm for Plasma Simulation on Intel MIC Xeon Phi ArchitectureAbhishek ChaurasiaFWDNXT IncSnowflake: Efficient Accelerator for Deep Neural NetworksShuai CheAdvanced Micro Devices IncGravel: Fine-Grain GPU-Initiated Network MessagesBingwei ChenTsinghua University15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz ScenariosChangsheng ChenUniversity of Massachusetts, DartmouthSun Yat-Sen UniversityVisualizations of a High-Resolution Global-Regional Nested, Ice-Sea-Wave Coupled Ocean Model SystemCheng ChenData Storage InstituteNational University of SingaporeTransactional NVM Cache with High Performance and Crash ConsistencyFeng ChenUniversity of TexasAdvanced Manycore Programming (KNL)Hsing-bung ChenLos Alamos National LaboratoryP55: Incorporating Proactive Data Rescue into ZFS Disk Recovery for Enhanced Storage ReliabilityJieyang ChenUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformTong ChenIBMImplementing Implicit OpenMP Data Sharing on GPUsXiaofei ChenSouthern University of Science and Technology, China15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz ScenariosXinyu ChenUniversity of New MexicoP53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaViewYen Chen ChenNational Taiwan UniversityA03: A High-Speed Algorithm for Genome-Wide Association Studies on Multi-GPU SystemsZizhong ChenUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformParaStack: Efficient Hang Detection for MPI Programs at Large ScaleSai P. ChennaUniversity of FloridaA FPGA-Pipelined Approach for Accelerated Discrete-Event Simulation of HPC SystemsGopinath ChennupatiLos Alamos National LaboratoryA Scalable Analytical Memory Model for CPU Performance PredictionMathew J. CherukaraArgonne National LaboratoryVisualizing Silicene Growth Through Island Migration and CoalescenceNaveen CherukuriIntel CorporationRun-to-Run Variability on Xeon Phi Based Cray XC SystemsKazem CheshmiRutgers UniversitySympiler: Transforming Sparse Matrix Codes by Decoupling Symbolic AnalysisWeng Cho ChewUniversity of IllinoisP16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering SolutionsAndrew ChienUniversity of ChicagoResilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance AnalysisBruce ChildersUniversity of PittsburghReproducibility and Uncertainty in High Performance ComputingWendy K. ChoNational Center for Supercomputing Applications, University of IllinoisP33: Massively Parallel Evolutionary Computation for Empowering Electoral Reform: Quantifying Gerrymandering via Multi-objective Optimization and Statistical AnalysisJaemin ChoiUniversity of Illinois at Urbana-ChampaignMigratable Objects and Task-Based Parallel Programming with Charm++A21: Runtime Support for Concurrent Execution of Overdecomposed Heterogeneous TasksAndrew Y. CholiyRutgers UniversityP12: Multi-Size Optional Offline Caching AlgorithmsFred ChongUniversity of ChicagoQuantum Computing and Irregular ApplicationsJerry chouNational Tsing Hua University, TaiwanOptimizing the Query Performance of Block Index Through Data Analysis and I/O ModelingEdmond ChowGeorgia Institute of TechnologyDistributed Southwell: An Iterative Method with Low Communication CostsInvited Talks 3Invited Talks 4AJ ChristensenNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseChing-Hsiang ChuOhio State UniversityA27: High-Performance and Scalable Broadcast Schemes for Deep Learning on GPU ClustersPi-Yueh ChuangGeorge Washington UniversityAn Example of Porting PETSc Applications to Heterogeneous Platforms with OpenACCNeil Chue HongUniversity of EdinburghIntroduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Software Engineering and Reuse in Computational Science and EngineeringSoftware Engineers: Careers in ResearchThe 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Ryan ChuiNational Center for Supercomputing Applications, University of IllinoisP38: Benchmarking Parallelized File Aggregation Tools for Large Scale Data ManagementSudheer ChunduriArgonne National LaboratoryRun-to-Run Variability on Xeon Phi Based Cray XC SystemsIHsin ChungIBMTowards a Composable Computer SystemVladimir ChupakhinJanssen Global Services LLCP62: How To Do Machine Learning on Big ClustersMichael ChuvelevIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Vojtech CimaTechnical University of OstravaP62: How To Do Machine Learning on Big ClustersFlorina M. CiorbaUniversity of BaselP74: A Methodology for Bridging the Native and Simulated Executions of Parallel ApplicationsSelim CiraciMicrosoftIntroduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Raymond C. Clay IIISandia National LaboratoriesEmbracing a New Era of Highly Efficient and Productive Quantum Monte Carlo SimulationsDavid CliftonANSYS IncHPC Systems Professionals WorkshopThomas CluneNASA Goddard Space Flight CenterpFlogger: The Parallel Fortran Logging Framework for HPC ApplicationsRichard CoffeyArgonne National LaboratoryFourth SC Workshop on Best Practices for HPC TrainingHPC Education: Meeting of the SIGHPC Education ChapterPaul CoffmanArgonne National LaboratoryWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Susan CoghlanArgonne National LaboratoryEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Albert CohenFrench Institute for Research in Computer Science and Automation (INRIA)Implementation of a Cache Miss Calculator in LLVM/PollyGary CollinsUniversity of TennesseeFlexible Batched Sparse Matrix-Vector Product on GPUsToni CollisAppentra Solutions, Women in High Performance ComputingEmbracing Diversity: the BenefitsPanel Discussion: Diversifying the HPC workforceIntroduction - Women in HPC: Diversifying the HPC CommunityCareer Panel Discussion: Hints and Tips to Progress Your CareerWorkshop Outcomes and ClosingSpeed NetworkingWomen in HPC: Non-Traditional Paths to HPC and How They Can and Do Enrich the FieldRecruitment: How to Build Diverse TeamsWomen in HPC: Diversifying the HPC WorkforceGuojing CongIBMAccelerating Deep Neural Network Learning for Speech Recognition on a Cluster of GPUsPaul ConstantineUniversity of Colorado, BoulderContemporary Design of Supercomputer ExperimentsMike ConwayRenaissance Computing InstituteVirtualization Ecosystems – Supporting Increasingly Complex Scientific ApplicationsSteve ConwayHyperion ResearchA Taxonomy of HPDA AlgorithmsBlurring the Lines: High-End Computing and Data ScienceJeanine CookSandia National LaboratoriesTime ManagementJonathan CookNew Mexico State UniversityP73: HPC Production Job Quality AssessmentJames CoomerDataDirect NetworksBest Practices for Architecting Performance and Capacity in the Burst Buffer EraBurst Buffers: Flash in the Pan?Marcin CopikRWTH Aachen UniversityA05: Parallel Prefix Algorithms for the Registration of Arbitrarily Long Electron Micrograph SeriesThomas CorcoranLawrence Berkeley National LaboratoryP36: A Novel Feature-Preserving Spatial Mapping for Deep Learning Classification of Ras StructuresAnthony CostaIcahn School of Medicine at Mount SinaiMedical Image Analysis and VisualizationTimothy B. CostaIntel CorporationDesigning Vector-Friendly Compact BLAS and LAPACK KernelsBatched, Reproducible, and Reduced Precision BLASJim CownieIntel CorporationLLVM in HPC: Uses and DesiresOpenMP® is Twenty. Where Is It Going?David CoxHarvard UniversityInput-Aware Auto-Tuning of Compute-Bound HPC KernelsDonna J. CoxNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseSilvia CrivelliLawrence Berkeley National LaboratoryP36: A Novel Feature-Preserving Spatial Mapping for Deep Learning Classification of Ras StructuresPeter D. CrossmanLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesCarlos A. CruzNASA Goddard Space Flight CenterpFlogger: The Parallel Fortran Logging Framework for HPC ApplicationsXuewen CuiVirginia TechP82: Performance Evaluation of the NVIDIA Tesla P100: Our Directive-Based Partitioning and Pipelining vs. NVIDIA’s Unified MemoryMassimiliano CulpoSwiss Federal Institute of Technology in LausanneManaging HPC Software Complexity with SpackEugenio CulurcielloFWDNXT IncSnowflake: Efficient Accelerator for Deep Neural NetworksMatthew L. CurrySandia National LaboratoriesP45: Campaign Storage: Erasure Coding with GPUsTony CurtisStony Brook UniversityOpenSHMEM in the Era of ExascaleReturn to TopDJohn D'AmbrosiaEthernet AllianceHuaweiThe Ethernet Portfolio for HPCNicholas D'ImperioBrookhaven National LaboratoryP34: GPU Acceleration for the Impurity Solver in GW+DMFT PackagesMichael D'melloIntel CorporationP30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon PhiFelipe H. da JornadaUniversity of California, BerkeleyP13: Large-Scale GW Calculations on Pre-Exascale HPC SystemsTamara DahlgrenLawrence Livermore National LaboratoryP94: Fully Hierarchical Scheduling: Paving the Way to Exascale WorkloadsChristopher S. DaleyLawrence Berkeley National LaboratoryPerformance and Energy Usage of Workloads on KNL and Haswell ArchitecturesPatricia DamkrogerIntel CorporationIntroduction - Women in HPC: Diversifying the HPC CommunityEarly Career CoachingAnthony DanalisUniversity of TennesseeP72: New Developments for PAPI 5.6+Tharun Kumar DangetiInternational Institute of Information Technology, HyderabadImproved Loop Distribution in LLVM Using Polyhedral DependencesAnwesha DasNorth Carolina State UniversityP89: Desh: Deep Learning for HPC System Health ResilienceArnab DasUniversity of UtahP84: PRESAGE: Selective Low Overhead Error Amplification for Easy DetectionSantanu DasInternational Institute of Information Technology, HyderabadImproved Loop Distribution in LLVM Using Polyhedral DependencesChristos DavatzikosUniversity of PennsylvaniaA Framework for Scalable Biophysics-Based Image AnalysisJames DavisUniversity of WarwickAn Efficient Task-Based All-Reduce for Machine Learning ApplicationsMiyuru DayarathnaWSO2 IncMultiple Stream Job Performance Optimization with Source Operator Graph TransformationsAndreas de BlancheUniversity West SwedenTetra PakP44: Increasing Throughput of Multiprogram HPC Workloads: Evaluating a SMT Co-Scheduling ApproachRaphael Y. de CamargoFederal University of ABC, Santo André, BrazilObtaining Dynamic Scheduling Policies with Simulation and Machine LearningCees de LaatUniversity of AmsterdamInnovating the Network for Data Intensive Science (INDIS)Gustavo De LeonLos Alamos National LaboratoryUniversity of California, BerkeleyP54: Investigating Hardware Offloading for Reed-Solomon EncodingDaniel OliveiraFluminense Federal UniversityToward Preserving Results Confidentiality in Cloud-Based Scientific WorkflowsDaniele De SensiUniversity of PisaNornir: A Power-Aware Runtime Support for Parallel ApplicationsBronis R. de SupinskiLawrence Livermore National LaboratoryAdvanced OpenMP: Performance and 4.5 FeaturesMastering Tasking with OpenMPP82: Performance Evaluation of the NVIDIA Tesla P100: Our Directive-Based Partitioning and Pipelining vs. NVIDIA’s Unified MemoryCutting Edge File Systems Tom DeakinUniversity of BristolP69: Portable Methods for Measuring Cache Hierarchy PerformanceDiptorup DebUniversity of North CarolinaQUARC: An Optimized DSL Framework Using LLVMNathan DebardelebenLos Alamos National LaboratoryExperimental and Analytical Study of Xeon Phi ReliabilityP92: Characterization and Comparison of Application Resilience for Serial and Parallel ExecutionsEwa DeelmanInformation Sciences Institute, University of Southern CaliforniarvGAHP – Push-Based Job Submission Using Reverse SSH ConnectionsMauro Del BenLawrence Berkeley National LaboratoryP13: Large-Scale GW Calculations on Pre-Exascale HPC SystemsRobert DeLeonUniversity at BuffaloTracking and Analyzing Job-level Activity Using Open XDMoD, XALT and OGRTRobert L. DeleonUniversity at BuffaloA Slurm Simulator: Implementation and Parametric AnalysisPhil DemarFermi National LaboratoryP43: Deep Packet/Flow Analysis Using GPUsDavid E. DeMarleKitware IncLarge Scale Visualization with ParaViewJames DemmelUniversity of California, BerkeleyScaling Deep Learning on GPU and Knights Landing ClustersLinear Algebra Libraries for High-Performance Computing: Scientific Computing with Multicore and AcceleratorsPaul DemorestNational Radio Astronomy Observatoryrealfast@VLANicolas DenoyelleFrench Institute for Research in Computer Science and Automation (INRIA)Modeling Large Compute Nodes with Heterogeneous Memories with the Cache-Aware Roofline ModelJohn W. DermerLos Alamos National LaboratoryP54: Investigating Hardware Offloading for Reed-Solomon EncodingJack DeslippeLawrence Berkeley National LaboratoryDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesHariharan DevarajanIllinois Institute of TechnologyEvaluating GPGPU Memory Performance Through the C-AMAT ModelOpen Ethernet Drive: Evolution of Energy-Efficient Storage TechnologyMehmet DeveciSandia National LaboratoriesDesigning Vector-Friendly Compact BLAS and LAPACK KernelsMike DewarNumerical Algorithms GroupHPC Software: Is “Cool Stuff” Really Incompatible with Sustainability?Salvatore Di GirolamoETH ZurichsPIN: High-Performance Streaming Processing in the NetworkA08: Virtualized Big Data: Reproducing Simulation Output on DemandSheng DiArgonne National LaboratoryAn Efficient Approach to Lossy Compression with Pointwise Relative Error BoundP37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum ChemistryLori DiachinLawrence Livermore National LaboratoryUsing HPC to Impact US Manufacturing through the HPC4Mfg ProgramGerrett DiamondRensselaer Polytechnic InstituteDynamic Load Balancing of Massively Parallel Unstructured MeshesPhilip DiamondSquare Kilometre ArrayLife, the Universe and Computing: The Story of the SKA TelescopeMattias DienerUniversity of IllinoisVisualizing, Measuring, and Tuning Adaptive MPI ParametersIntegrating OpenMP into the Charm++ Programming ModelMark DietrichCompute CanadaSupercomputing in the Shadow of Giants: Perspectives and Insights from Supercomputing Leaders Outside the “Big 5” Regions and OrganizationsAmericas HPC CollaborationGary A. DiltsLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesNan DingTsinghua UniversityRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLightMinh DinhUniversity of QueenslandFive-minute presentations by young researchers from around the world - part 1Sebastian DoebelTechnical University DresdenAn LLVM Instrumentation Plug-In for Score-PDouglas DoerflerLawrence Berkeley National LaboratoryPerformance and Energy Usage of Workloads on KNL and Haswell ArchitecturesUsability, Scalability and Productivity on Many-Core Processors: Intel Xeon PhiJiri DokulilUniversity of ViennaExtending the Open Community Runtime with External Application SupportDavid DomyancicLawrence Livermore National LaboratoryP94: Fully Hierarchical Scheduling: Paving the Way to Exascale WorkloadsBin DongLawrence Berkeley National LaboratoryOptimizing the Query Performance of Block Index Through Data Analysis and I/O ModelingWenjie DongSun Yat-Sen UniversityVisualizations of a High-Resolution Global-Regional Nested, Ice-Sea-Wave Coupled Ocean Model SystemJack DongarraUniversity of TennesseeIntroduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale SystemsInvestigating Half-Precision Arithmetic to Accelerate Dense Linear System SolversFlexible Batched Sparse Matrix-Vector Product on GPUsDynamic Task Discovery in PaRSEC- A Data-Flow Task-Based RuntimeKeynote - An Overview of High Performance Computing and Challenges for the FutureTOP500 - Past, Present, FutureLinear Algebra Libraries for High-Performance Computing: Scientific Computing with Multicore and AcceleratorsBig Data and Exascale Computing (BDEC) Community ReportTOP500 SupercomputersBatched, Reproducible, and Reduced Precision BLASP72: New Developments for PAPI 5.6+David DonofrioLawrence Berkeley National LaboratoryWorkshop for Open Source SupercomputingPARADISE: A ToolFlow to Model Emerging Technologies for the Post-CMOS Era in HPCReconfigurable Computing in ExascaleRion DooleyUniversity of TexasHPC via HTTP: Portable, Scalable Computing Using App Containers and the Agave APIMatthieu DorierArgonne National LaboratorySupporting Task-level Fault-Tolerance in HPC Workflows by Launching MPI Jobs inside MPI JobsMatthieu DreherArgonne National LaboratoryIn Situ Workflows at Exascale: System Software to the RescueNikoli DrydenUniversity of IllinoisLawrence Livermore National LaboratoryToward Scalable Parallel Training of Deep Neural NetworksDavid H.C. DuUniversity of MinnesotaP56: ZoneTier: A Zone-Based Storage Tiering and Caching Co-Design to Integrate SSDs with Host-Aware SMR DrivesXiaohui DuanShandong UniversityRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLightNicolas DubeHewlett PackardPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlAnshu DubeyArgonne National LaboratoryUniversity of ChicagoProposal for a Scientific Software Lifecycle ModelBetter Scientific SoftwareMultiphysicsPradeep DubeyIntel CorporationDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesArtificial Intelligence and The Virtuous Cycle of ComputeNicolas DubéHewlett PackardEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Dmitry DuplyakinUniversity of UtahContemporary Design of Supercomputer ExperimentsEarl DuqueIntelligent LightIntroduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationHPC Powers Wind EnergyISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationThomas DurbinDurbin EngineeringEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Dmitry DurnovIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Matthew DwyerUniversity of NebraskaTowards Self-Verification in Finite Difference Code GenerationMichael D’melloIntel CorporationAn Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi ProcessorReturn to TopEJonathan EastepIntel CorporationPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlP95: GEOPM: A Scalable Open Runtime Framework for Power ManagementJoe EatonNvidia CorporationParallel Jaccard and Related Graph Clustering TechniquesJerry EbalunodeUniversity of HoustonVistas in Advanced ComputingH. Carter EdwardsSandia National LaboratoriesKokkos: Enabling Manycore Performance Portability for C++ Applications and Domain Specific Libraries/LanguagesStratos EfstathiadisNew York UniversitySecond Annual Meeting of the SIGHPC - Big Data ChapterAlexandre EichenbergerIBMImplementing Implicit OpenMP Data Sharing on GPUsStephan EidenbenzLos Alamos National LaboratoryA Scalable Analytical Memory Model for CPU Performance PredictionVictor EijkhoutUniversity of TexasAdvanced Manycore Programming (KNL)Greg EisenhauerGeorgia Institute of TechnologyParallel Streaming for In Transit Analysis with Heterogeneous Data LayoutDaniel EisensteinHarvard UniversityGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesMohamed El-HadedyUniversity of IllinoisRE-HASE: Regular-Expressions Hardware Synthesis EngineIzzat El-HajjUniversity of IllinoisP16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering SolutionsNosayba El-SayedMassachusetts Institute of TechnologyQatar Computing Research InstituteUnderstanding Object-Level Memory Access Patterns Across the SpectrumAhmed EleliemyUniversity of BaselP74: A Methodology for Bridging the Native and Simulated Executions of Parallel ApplicationsSally EllingsonUniversity of KentuckyDeep LearningMichael J. Ellsworth, Jr.IBMEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Nahid EmadUniversity of VersaillesMaison de la SimulationParallel Jaccard and Related Graph Clustering TechniquesRuntime Correctness Checking for Emerging Programming ParadigmsJoel EmerNvidia CorporationUnderstanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and ApplicationsToshio EndoTokyo Institute of TechnologyTSUBAME3.0: A Green, Accelerated, Big-Data SupercomputerApplying Temporal Blocking with a Directive-Based ApproachState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P05: ooc_cuDNN : A Deep Learning Library Supporting CNNs over GPU Memory CapacityChristian EngelmannOak Ridge National LaboratoryIntroduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale SystemsFailures in Large Scale Systems: Long-Term Measurement, Analysis, and ImplicationsCharacterizing Faults, Errors, and Failures in Extreme-Scale SystemsNicolás ErdödyOpen Parallel LtdSKA: The Ultimate Big Data ProjectThe Internet of Things and HPC: Are They Teaming Up to Work Together?Mattan ErezUniversity of TexasSilent Errors in HPC SystemsRajeev S. ErramilliLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesOscar Esquivel-FloresMonterrey Institute of TechnologyInvited Talk - On Improved Monte Carlo Hybrid Methods for Preconditioner ComputationsTrilce EstradaUniversity of New MexicoPanel: Attracting Women and Underrepresented Minorities to HPC and Data ScienceRevisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsP53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaViewJean-Matthieu ETANCELINUniversity of Reims Champagne-ArdenneP64: romeoLAB : HPC Training Platform on HPC facilityReturn to TopFPeyman FaizianFlorida State UniversityModeling UGAL on the Dragonfly TopologyA Comparative Study of SDN and Adaptive Routing on Dragonfly NetworksAlessandro FanfarilloNational Center for Atmospheric ResearchPerformance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray FortranAiman FangUniversity of ChicagoResilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance AnalysisJian FangDelft University of TechnologyAdopting OpenCAPI for High Bandwidth Database AcceleratorsMassimiliano FaticaNvidia CorporationA Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU SystemsFarzad Fatollahi-FardLawrence Berkeley National LaboratoryWorkshop for Open Source SupercomputingChristian FeldJuelich Supercomputing CenterHands-On Practical Hybrid Parallel Application Performance EngineeringAlexandre FenderNvidia CorporationUniversity of VersaillesParallel Jaccard and Related Graph Clustering TechniquesWu FengVirginia TechThe Green500: Trends in Energy-Efficient SupercomputingEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)P82: Performance Evaluation of the NVIDIA Tesla P100: Our Directive-Based Partitioning and Pipelining vs. NVIDIA’s Unified MemoryJohn FeoPacific Northwest National LaboratoryIntroduction - IA^3 2017 - 7th Workshop on Irregular Applications: Architectures and AlgorithmsIA^3 2017 - 7th Workshop on Irregular Applications: Architectures and AlgorithmsCharles R. FerenbaughLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesMark FernandezHewlett Packard EnterpriseHPC in Space: Supercomputing at 17,500 MPHRafael Ferreira da SilvaUniversity of Southern CaliforniaOn the Use of Burst Buffers for Accelerating Data-Intensive Scientific WorkflowsKurt B. FerreiraSandia National LaboratoriesP93: Spacehog: Evaluating the Costs of Dedicating Resources to In Situ AnalysisNicola FerrierArgonne National LaboratoryIntroduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationDaniel A. FeshbachHaverford CollegeA20: Correctness Verification and Boundary Conditions for Chapel Iterator-Based Loop OptimizationAdam FidelTexas A&M UniversityBounded Asynchrony and Nested Parallelism for Scalable Graph ProcessingSteve FieldsIBMOpenCAPI: High Performance, Host-Agnostic, Coherent Accelerator InterfaceWeronika FilingerUniversity of EdinburghFrom Outreach to Education to Researcher - Innovative Ways of Expanding the HPC CommunitySalvatore FilipponeCranfield UniversityIntroduction - PAW 2017: The 2nd Annual PGAS Applications WorkshopHal FinkelArgonne National LaboratoryFPGAs for Supercomputing? Progress and ChallengesIntroduction - LLVM-HPC2017: Fourth Workshop on the LLVM Compiler Infrastructure in HPCDeveloping an OpenMP Runtime for UVM-Capable GPUsBenchmarking and Evaluating Unified Memory for OpenMP GPU OffloadingOpenMP 4.5 Validation and Verification SuiteConcluding Remarks – LLVM-HPC2017Distributed and Heterogeneous Programming in C++ for HPCLLVM in HPC: Uses and DesiresLLVM-HPC2017: Fourth Workshop on the LLVM Compiler Infrastructure in HPCPaul FischerUniversity of IllinoisWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Goran FlegarJaume I UniversityOvercoming Load Imbalance for Irregular Sparse MatricesFlexible Batched Sparse Matrix-Vector Product on GPUsKermin FlemingIntel CorporationLESS: Loop Nest Execution Strategies for Spatial ArchitecturesFernanda FoertterOak Ridge National LaboratoryFourth SC Workshop on Best Practices for HPC TrainingCareer Panel Discussion: Hints and Tips to Progress Your CareerOvercoming the Confidence GapParallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACCThe OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactAn Example of Porting PETSc Applications to Heterogeneous Platforms with OpenACCKokkos: Enabling Manycore Performance Portability for C++ Applications and Domain Specific Libraries/LanguagesOpenACC API User Experience, Vendor Reaction, Relevance, and RoadmapHPC Education: Meeting of the SIGHPC Education ChapterInteractivity in SupercomputingMike FolkHDF GroupSoftware Engineering and Reuse in Computational Science and EngineeringJohn FonnerTexas Advanced Computing Center, University of TexasHPC via HTTP: Portable, Scalable Computing Using App Containers and the Agave APIJohn C. ForbesHarvard UniversityHarvard-Smithsonian Center for AstrophysicsMilky Way Analogue Isolated Disk GalaxyAndrea FormisanoUniversity of PerugiaAccelerating Energy Games Solvers on Modern ArchitecturesIan FosterArgonne National LaboratoryIntroduction - The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)Cloud Computing for Science and EngineeringThe 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)Pouya FotouhiUniversity of California, DavisP50: Energy-Efficient and Scalable Bio-Inspired Nanophotonic ComputingP49: Toward Exascale HPC Systems: Exploiting Advances in High Bandwidth Memory (HBM2) through Scalable All-to-All Optical Interconnect ArchitecturesYvan FournierEDF FranceMelissa: Large Scale In Transit Global Sensitivity Analysis Avoiding Intermediate FilesRobert J. FowlerUniversity of North CarolinaQUARC: An Optimized DSL Framework Using LLVMWilliam FoxGeorgia Institute of TechnologyUniversity of California, San FranciscoE-HPC: A Library for Elastic Resource Management in HPC EnvironmentsFranz FranchettiCarnegie Mellon UniversityP06: Large Scale FFT-Based Stress-Strain Simulations with Irregular Domain DecompositionIA^3 DebateTommy FranczakNorthern Illinois UniversityA Path from Serial Execution to Hybrid Parallelization for Learning HPCRobert Freeman JrHarvard UniversityHPC Carpentry - Practical, Hands-On HPC TrainingBernhard FriebeIntel CorporationEnabling FPGAs for the Software DevelopersBrian FriesenNational Energy Research Scientific Computing CenterPerformance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray FortranGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesHaohan FuTsinghua UniversityNational Supercomputing Center, WuxiRedesigning CAM-SE for Peta-Flops Performance on Sunway TaihuLightLessons on Integrating and Utilizing 10 Million Cores: Experience of Sunway TaihuLightRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLight15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz ScenariosSong FuUniversity of North TexasP55: Incorporating Proactive Data Rescue into ZFS Disk Recovery for Enhanced Storage ReliabilityAkihiro FujiiKogakuin UniversityP14: Robust SA-AMG Solver by Extraction of Near-Kernel VectorsKatsuki FujisawaKyushu UniversityNational Institute of Advanced Industrial Science and TechnologyCyber-Physical System and Industrial Applications of Large-Scale Graph Analysis and Optimization ProblemsP78: Performance Evaluation of Graph500 Considering CPU-DRAM Power ShiftingHajime FujitaIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Kohei FujitaUniversity of TokyoRIKENImplicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACCP09: Adaptive Multistep Predictor for Accelerating Dynamic Implicit Finite-Element SimulationsP23: AI with Super-Computed Data for Monte Carlo Earthquake Hazard ClassificationDouglas FullerRed Hat IncCeph Applications in HPC EnvironmentsStudent/Postdoc Job FairThomas R. FurlaniUniversity at BuffaloA Slurm Simulator: Implementation and Parametric AnalysisTracking and Analyzing Job-level Activity Using Open XDMoD, XALT and OGRTMikito FuruichiJapan Agency for Marine-Earth Science and TechnologyP21: The First Real-Scale DEM Simulation of a Sandbox Experiment Using 2.4 Billion ParticlesYasunori FutamuraUniversity of TsukubaEfficient and Scalable Calculation of Complex Band Structure Using Sakurai-Sugiura MethodReturn to TopGAbhinav GabaIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationAlice-Agnes GabrielLudwig Maximilian University of MunichExtreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust EarthquakeNiall GaffneyUniversity of TexasVirtualization Ecosystems – Supporting Increasingly Complex Scientific ApplicationsAna GainaruVanderbilt UniversityPeriodic I/O Scheduling for SupercomputersKelly GaitherUniversity of TexasPanel Discussion: Diversifying the HPC workforceIntroduction - Women in HPC: Diversifying the HPC CommunityCareer Panel Discussion: Hints and Tips to Progress Your CareerHints and Tips for Public SpeakingHigh Performance Computing Education in US Data ScienceScientific Visualization & Data Analytics ShowcaseJames GalarowiczKrell InstituteHow To Analyze the Performance of Parallel Codes 101Steven M. GalloUniversity at BuffaloA Slurm Simulator: Implementation and Parametric AnalysisTodd GamblinLawrence Livermore National Laboratory4th International Workshop on HPC User Support Tools (HUST-17)Projecting Performance Data Over Simulation Geometry Using SOSflow and AlpineScrubJay: Deriving Knowledge from the Disarray of HPC Performance DataPerformance Modeling under Resource Constraints Using Deep Transfer LearningPredicting the Performance Impact of Different Fat-Tree ConfigurationsManaging HPC Software Complexity with SpackP75: Model-Agnostic Influence Analysis for Performance DataLin GanTsinghua UniversityRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLightAryya GangopadhyayUniversity of Maryland, Baltimore CountyMultidisciplinary Education on Big Data + HPC + Atmospheric SciencesSangram GangulyNASA Ames Research CenterCommon Big Data Challenges in Bio, Geo, Climate, and Social SciencesDennis GannonIndiana UniversityCloud Computing for Science and EngineeringGuang GaoUniversity of DelawareVerification of the Extended Roofline Model for Asynchronous Many Task RuntimesP99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big DataTao GaoUniversity of DelawareA23: Evaluation of Data-Intensive Applications on Intel Knights Landing ClusterEric GarciaIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationRao V. GarimellaLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesMichael GarlandNvidia CorporationParallel Depth-First Search for Directed Acyclic GraphsNitin A. GawandePacific Northwest National LaboratoryEvaluating On-Node GPU Interconnects for Deep Learning WorkloadsMarkus GeimerJuelich Supercomputing CenterHands-On Practical Hybrid Parallel Application Performance EngineeringAl GeistOak Ridge National LaboratoryIntroduction - 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale SystemsBrad GeltzIntel CorporationP95: GEOPM: A Scalable Open Runtime Framework for Power ManagementAnn gentileSandia National LaboratoriesHPC Systems Monitoring Data in ActionRaffaella GentiliniUniversity of PerugiaAccelerating Energy Games Solvers on Modern ArchitecturesGiorgis GeorgakoudisQueen's University BelfastREFINE: Realistic Fault Injection via Compiler-Based Instrumentation for Accuracy, Portability and SpeedEvangelos GeorganasIntel CorporationP31: Understanding the Performance of Small Convolution Operations for CNN on Intel ArchitectureAlan GeorgeUniversity of PittsburghReconfigurable Supercomputing (RSC)Daniel GeorgeNational Center for Supercomputing Applications, University of IllinoisA13: Deep Learning with HPC Simulations for Extracting Hidden Signals: Detecting Gravitational WavesRichard GerberLawrence Berkeley National LaboratoryFourth SC Workshop on Best Practices for HPC TrainingLisa GerhardtLawrence Berkeley National LaboratoryContainer Computing for HPC and Scientific WorkflowsSandra GesingUniversity of Notre DameIntroduction - WORKS 2017 (12th Workshop on Workflows in Support of Large-Scale Science)Berk GeveciKitware IncIn Situ Summarization with VTK-mSheikh K. GhafoorTennessee Technological UniversityIntroduction - Workshop on Education for High Performance Computing (EduHPC)Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsAmir GholamiUniversity of TexasA Framework for Scalable Biophysics-Based Image AnalysisExperiencing HPC for Undergraduates: Graduate Student PerspectiveDevarshi GhoshalLawrence Berkeley National LaboratoryE-HPC: A Library for Elastic Resource Management in HPC EnvironmentsPaolo GiannozziUniversity of UdineA Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU SystemsPaul GibbonForschungszentrum JuelichP87: EoCoE Performance Benchmarking Methodology for Renewable Energy ApplicationsMike GilesUniversity of OxfordBeyond 16GB: Out-of-Core Stencil ComputationsP01: Cache-Blocking Tiling of Large Stencil Codes at RuntimeLauren GillespieSouthwestern UniversityP47: Understanding Congestion on Omni-Path FabricsLadina GillySwiss National Supercomputing CentreEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Alfredo GimenezLawrence Livermore National LaboratoryUniversity of California, DavisScrubJay: Deriving Knowledge from the Disarray of HPC Performance DataProjecting Performance Data Over Simulation Geometry Using SOSflow and AlpineJudit GimenezBarcelona Supercomputing CenterIntroduction - 4th International Workshop on Visual Performance Analytics – VPA 2017Fourth International Workshop on Visual Performance Analysis – VPA 2017Benjamin H. GlickLewis & Clark CollegeA07: Scalable Parallel Scripting in the CloudMatthias GobbertUniversity of Maryland, Baltimore CountyMultidisciplinary Education on Big Data + HPC + Atmospheric SciencesJens Henrik GoebbertForschungszentrum JuelichComprehensive Visualization of Large-Scale Simulation Data Linked to Respiratory Flow Computations on HPC SystemsBrice GoglinFrench Institute for Research in Computer Science and Automation (INRIA)Modeling Large Compute Nodes with Heterogeneous Memories with the Cache-Aware Roofline ModelCross-Layer Allocation and Management of Hardware Resources in Shared Memory NodesEng Lim GohHewlett Packard EnterpriseHPC in Space: Supercomputing at 17,500 MPHAli Murat GokArgonne National LaboratoryNorthwestern UniversityP37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum ChemistryNathan J. GoldbaumNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyDeb GoldfarbIntel CorporationNegotiation SkillsCareer Panel Discussion: Hints and Tips to Progress Your CareerSally GoldmanGooglePanel: Attracting Women and Underrepresented Minorities to HPC and Data ScienceAntonio Tedu A. GomesNational Laboratory for Scientific Computing, BrazilSupercomputing in the Shadow of Giants: Perspectives and Insights from Supercomputing Leaders Outside the “Big 5” Regions and OrganizationsCanstantino GomezBarcelona Supercomputing CenterFive-minute presentations by young researchers from around the world - part 2Rosalia GomezTexas Advanced Computing Center, University of TexasHigh Performance Computing Education in US Data ScienceQian GongFermi National LaboratoryP43: Deep Packet/Flow Analysis Using GPUsYifan GongTuSimpleEfficient Process Mapping in Geo-Distributed Cloud Data CentersElsa GonsiorowskiLawrence Livermore National LaboratoryCareer Panel Discussion: Hints and Tips to Progress Your CareerHow to Take the Next Step in Your CareerGanesh GopalakrishnanUniversity of UtahP84: PRESAGE: Selective Low Overhead Error Amplification for Easy DetectionMark GordonIowa State UniversityAn Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi ProcessorP32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDsMark S. GordonIowa State UniversityPorting a GAMESS Computational Chemistry Kernel to FPGAsP30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon PhiSteven GordonOhio Supercomputer CenterA Proposed Model for Teaching Advanced Parallel Computing and Related TopicsHPC Education: Meeting of the SIGHPC Education ChapterSergei GorlatchUniversity of MunsterPACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming ModelGerard GormanImperial College, LondonTowards Self-Verification in Finite Difference Code GenerationSoftware Engineering and Reuse in Computational Science and EngineeringR. GovindarajanIndian Institute of ScienceHPC Initiatives in IndiaPaolo GraniUniversity of California, DavisP49: Toward Exascale HPC Systems: Exploiting Advances in High Bandwidth Memory (HBM2) through Scalable All-to-All Optical Interconnect ArchitecturesDavid GrantOak Ridge National LaboratoryEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Ryan GrantSandia National LaboratoriesWorkshop on Exascale MPI (ExaMPI)sPIN: High-Performance Streaming Processing in the NetworkPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlCatherine GravesHewlett PackardComputing with Physics: Analog Computation and Neural Network Classification with a Dot Product EngineJennifer GreenLos Alamos National LaboratoryHow To Analyze the Performance of Parallel Codes 101Kevin GriffinLawrence Livermore National LaboratoryScalable HPC Visualization and Data Analysis Using VisItLeopold GrinbergIBMP79: Porting the Opacity Client Library to a CPU-GPU Cluster Using OpenMP 4.5William GroppUniversity of IllinoisChallenges in Programming Extreme Scale SystemsEnergy Efficiency Gains From Software: Retrospectives and PerspectivesAdvanced MPI ProgrammingSoftware Engineering and Reuse in Computational Science and EngineeringP70: FFT, FMM, and Multigrid on the Road to Exascale: Performance Challenges and OpportunitiesTobias GrosserETH ZurichImproved Loop Distribution in LLVM Using Polyhedral DependencesMax GrossmanRice UniversityGraph500 on OpenSHMEM: Using a Practical Survey of Past Work to Motivate Novel Algorithmic DevelopmentsChapel-on-X: Exploring Tasking Runtimes for PGAS LanguagesRobert GrossmanUniversity of ChicagoBlurring the Lines: High-End Computing and Data ScienceSharing Research Data: Data Commons, Distributed Clouds, and Distributed Data ServicesPaola GrossoUniversity of AmsterdamInnovating the Network for Data Intensive Science (INDIS)Kenny GruchallaNational Renewable Energy LaboratoryContextual Compression of Large-Scale Wind Turbine Array SimulationsHui GuanNorth Carolina State UniversityEgeria: A Framework for Auto-Construction of HPC Advising Tools through Multi-Layered Natural Language ProcessingQiang GuanLos Alamos National LaboratoryUltrascale Systems Research CenterP92: Characterization and Comparison of Application Resilience for Serial and Parallel ExecutionsP53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaViewErnesto GuerreroUniversity of MalagaParallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACCShashank GugnaniOhio State UniversityA06: Accelerating Big Data Processing in the Cloud with Scalable Communication and I/O SchemesPablo GuillenUniversity of HoustonVistas in Advanced ComputingRaghul GunasekaranOak Ridge National LaboratoryScientific User Behavior and Data-Sharing Trends in a Petascale File SystemGUIDE: A Scalable Information Directory Service to Collect, Federate, and Analyze Logs for Operational Insights into a Leadership HPC FacilityMurat E. GuneyIntel CorporationDesigning Vector-Friendly Compact BLAS and LAPACK KernelsPeng GuoChinese Academy of SciencesChinese Academy of SciencesTessellating StencilsXinfei GuoUniversity of VirginiaRE-HASE: Regular-Expressions Hardware Synthesis EngineXuan GuoOak Ridge National LaboratoryIntroduction - The Eighth International Workshop on Data-Intensive Computing in the CloudsYanfei GuoArgonne National LaboratoryWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1MPICH: A High-Performance Open-Source MPI ImplementationAnshul GuptaIBMIntroduction - Workshop on Education for High Performance Computing (EduHPC)Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsRajiv GuptaUniversity of California, RiversideParaStack: Efficient Hang Detection for MPI Programs at Large ScaleRavi GuptaIntel CorporationSnowpack: Efficient Parameter Choice for GPU Kernels via Static Analysis and Statistical PredictionSaurabh GuptaIntel CorporationFailures in Large Scale Systems: Long-Term Measurement, Analysis, and ImplicationsBlazingText: Scaling and Accelerating Word2Vec using Multiple GPUsLevent GurelUniversity of IllinoisP16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering SolutionsJohn L. GustafsonNational University of SingaporePosit ResearchPosit Math Unit (PMU) – A New Approach Toward Exascale ComputingImproving Numerical Computation with Practical Tools and Novel Computer ArithmeticEthan GutmannNational Center for Atmospheric ResearchPerformance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray FortranMarkus GötzResearch Center JuelichSupporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML FrameworkReturn to TopHRoland HaasNational Center for Supercomputing Applications, University of IllinoisP38: Benchmarking Parallelized File Aggregation Tools for Large Scale Data ManagementSonja HabbingaForschungszentrum JuelichComprehensive Visualization of Large-Scale Simulation Data Linked to Respiratory Flow Computations on HPC SystemsSalman HabibArgonne National LaboratoryCosmological Particle Data Compression in PracticeElie HachemMines ParisTechSupercomputing for Everyone: Meeting the Growing Needs of BusinessesSebastian HackSaarland UniversityPACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming ModelDaniel HackenbergTechnical University DresdenEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Towards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core ArchitecturesWalker HaddockUniversity of Alabama, BirminghamP45: Campaign Storage: Erasure Coding with GPUsBilel HadriKing Abdullah University of Science and TechnologyFourth SC Workshop on Best Practices for HPC TrainingEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Regression Testing and Monitoring ToolsState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersMatthieu HaefeleFrench Alternative Energies and Atomic Energy CommissionP87: EoCoE Performance Benchmarking Methodology for Renewable Energy ApplicationsRaphael Tuvia HaftkaUniversity of FloridaMulti-Fidelity Surrogate Modeling for Application/Architecture Co-DesignHans HagenUniversity of KaiserslauternCosmological Particle Data Compression in PracticeGeorg HagerUniversity of Erlangen-NurembergNode-Level Performance EngineeringChristoph HagleitnerIBMApplication Porting and Optimization on GPU-Accelerated POWER ArchitecturesGabriel HahnBaylor UniversityP35: Using HPC to Model Quantum-Dot Cellular AutomataAzzam HaidarUniversity of TennesseeInvestigating Half-Precision Arithmetic to Accelerate Dense Linear System SolversBatched, Reproducible, and Reduced Precision BLASMichael HaidlUniversity of MunsterPACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming ModelMahantesh HalappanavarPacific Northwest National LaboratoryHPC Graph Toolkits and the GraphBLAS ForumMary HallUniversity of UtahWriting Effective ProposalsBernd HamannUniversity of California, DavisScrubJay: Deriving Knowledge from the Disarray of HPC Performance DataKhaled HamidoucheAdvanced Micro Devices IncGPU Triggered Networking for Intra-Kernel CommunicationsDorit M. HammerlingNational Center for Atmospheric ResearchQuality Assurance and Error Identification for the Community Earth System ModelSimon D. HammondSandia National LaboratoriesIntroduction - The 8th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computer Systems (PMBS17)Designing Vector-Friendly Compact BLAS and LAPACK KernelsThe 8th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computer Systems (PMBS17)Sunggeun HanKorea Institute of Science and Technology InformationP51: TuPiX-Flow: Workflow-Based Large-Scale Scientific Data Analysis SystemToshihiro HanawaUniversity of TokyoState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)David HancockIndiana UniversityFuture Trends in HPCMatthew R. HanlonTexas Advanced Computing Center, University of TexasSecuring HPC: Development of a Low Cost, Open Source, Multi-Factor Authentication InfrastructureSean HanlonNational Cancer InstituteImpacting Cancer with HPC: Opportunities and ChallengesRiyaz HaqueLawrence Livermore National LaboratoryP79: Porting the Opacity Client Library to a CPU-GPU Cluster Using OpenMP 4.5Guénolé HarelAtomic Energy and Alternative Energies CommissionLean Visualization of Large Scale Tree-Based AMR MeshesSiva HariNvidia CorporationUnderstanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and ApplicationsKevin HarmsArgonne National LaboratoryRun-to-Run Variability on Xeon Phi Based Cray XC SystemsPeter Z. HarringtonUniversity of California, Santa CruzA16: Diagnosing Parallel I/O Bottlenecks in HPC ApplicationsJ. Austin HarrisOak Ridge National LaboratoryP26: Optimizing Gravity and Nuclear Physics in FLASH for ExascaleCyrus HarrisonLawrence Livermore National LaboratoryProjecting Performance Data Over Simulation Geometry Using SOSflow and AlpineScalable HPC Visualization and Data Analysis Using VisItWilliam HarrodUS Department of EnergyNational Strategic Computing InitiativePost Moore SupercomputingNational Strategic Computing Initiative UpdateRebecca Hartman-BakerLawrence Berkeley National LaboratoryFourth SC Workshop on Best Practices for HPC TrainingIntroduction - Women in HPC: Diversifying the HPC CommunityCareer Panel Discussion: Hints and Tips to Progress Your CareerEffective Workplace CommunicationHPC Software: Is “Cool Stuff” Really Incompatible with Sustainability?Women in HPC: Non-Traditional Paths to HPC and How They Can and Do Enrich the FieldMasayuki HatanakaRIKENWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1P48: Prototyping of Offloaded Persistent Broadcast on Tofu2 InterconnectAkihiro HAYASHIRice UniversityExploration of Supervised Machine Learning Techniques for Runtime Selection of CPU vs GPU Execution in Java ProgramsChapel-on-X: Exploring Tasking Runtimes for PGAS LanguagesLinda HaydenElizabeth City State UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesBingsheng heNational University of SingaporeEfficient Process Mapping in Geo-Distributed Cloud Data CentersConghui HeTsinghua University15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz ScenariosYun (Helen) HeLawrence Berkeley National LaboratoryOpenMP Common Core: A “Hands-On” ExplorationMario HeeneUniversity of StuttgartA Highly Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma SimulationsSean HeftyIntel CorporationFabric APIs - libfabric User Perspective and C++ StandardizationAlexander HeineckeIntel CorporationP31: Understanding the Performance of Small Convolution Operations for CNN on Intel ArchitectureAlan HeirichStanford UniversitySLAC National Accelerator LaboratoryIn Situ Visualization with Task-Based ParallelismKatrin HeitmannArgonne National LaboratoryCosmological Particle Data Compression in PracticeStijn HeldensUniversity of TwenteP86: HyGraph: High Performance Graph Processing on Hybrid CPU+GPUs platformsBarbara HellandUS Department of EnergySmall Business and the Exascale Computing ProjectGreg HenryIntel CorporationP31: Understanding the Performance of Small Convolution Operations for CNN on Intel ArchitectureThomas HeraultUniversity of TennesseeDynamic Task Discovery in PaRSEC- A Data-Flow Task-Based RuntimeReliability, Fault Tolerance, and ResilienceRandy HerbanCycle ComputingHPC Systems Professionals WorkshopStephen HerbeinUniversity of DelawareP94: Fully Hierarchical Scheduling: Paving the Way to Exascale WorkloadsMartin HerbordtBoston UniversityOpenCL for FPGAs/HPC: Case Study in 3D FFTReconfigurable Supercomputing (RSC)P42: TRIP: An Ultra-Low Latency, TeraOps/s Reconfigurable Inference Processor for Multi-Layer PerceptronsOscar HernandezOak Ridge National LaboratoryOpenMP 4.5 Validation and Verification SuiteOpenSHMEM in the Era of ExascaleChristian HeroldTechnical University DresdenAn LLVM Instrumentation Plug-In for Score-PP67: Measuring I/O Behavior on Upcoming Systems with NVRAMP66: Analyzing Multi-Layer I/O Behavior of HPC ApplicationsMichael A. HerouxSandia National LaboratoriesSt. John’s UniversitySoftware Engineering for Computational Science and Engineering: What Can Work and What Will NotKeynote - A Holistic Approach to Advancing Science and Engineering through Extreme-Scale ComputingResearch MethodsLinear Algebra Libraries for High-Performance Computing: Scientific Computing with Multicore and AcceleratorsBetter Scientific SoftwareSoftware Engineering and Reuse in Computational Science and EngineeringPractical Reproducibility by Managing Experiments Like SoftwareAngela M. HerringLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesEmily HerronMercer UniversityA12: Applying Image Feature Extraction to Cluttered Scientific RepositoriesAndreas HertenForschungszentrum JuelichApplication Porting and Optimization on GPU-Accelerated POWER ArchitecturesWilliam Judson HerveyNaval Research LaboratoryP18: A Parallel Python Implementation of BLAST+ (PPIB) for Characterization of Complex Microbial ConsortiaMary HesterSURFnetInnovating the Network for Data Intensive Science (INDIS)W. Terry HewittWTH Associates LtdHPC Acquisition and CommissioningJason HickLos Alamos National LaboratoryTotal Cost of Ownership and HPC System ProcurementSusan HicksOak Ridge National LaboratoryP59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing EnvironmentsMert HidayetogluUniversity of IllinoisP16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering SolutionsJan HiddersVrije Universiteit BrusselAdopting OpenCAPI for High Bandwidth Database AcceleratorsJoshua HigginsUniversity of HuddersfieldTeaching Parallel Computing with Container VirtualizationDean HildebrandGoogle LLCJoint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems (PDSW-DISCS)P58: Wharf: Sharing Docker Images across Hosts from a Distributed FilesystemUse and Management of Non-Volatile MemoriesPamela HillNational Center for Atmospheric ResearchBest Practices for Architecting Performance and Capacity in the Burst Buffer EraTorsten HoeflerETH ZurichIntroduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicScaling Betweenness Centrality Using Communication-Efficient Sparse Matrix MultiplicationsPIN: High-Performance Streaming Processing in the NetworkPublishingAdvanced MPI Programming H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicHenry HoffmannUniversity of ChicagoP98: Energy Efficiency in HPC with Machine Learning and Control TheorySteven HofmeyrLawrence Berkeley National LaboratoryP98: Energy Efficiency in HPC with Machine Learning and Control TheoryPeter HofsteeIBMAdopting OpenCAPI for High Bandwidth Database AcceleratorsAdolfy HoisiePacific Northwest National LaboratoryEnergy Efficient Supercomputing (E2SC)Representative Paths AnalysisEvaluating On-Node GPU Interconnects for Deep Learning WorkloadsJeffrey HokansonColorado School of MinesContemporary Design of Supercomputer ExperimentsJeffrey K. HollingsworthUniversity of MarylandAwards CeremonyDaniel HolmesUniversity of EdinburghIntroduction - Women in HPC: Diversifying the HPC CommunityEffective Programming Models for Deep Learning at ScaleVioleta HolmesUniversity of HuddersfieldTeaching Parallel Computing with Container VirtualizationHans-Christian HoppeIntel CorporationIntel CorporationReconfigurable Computing in ExascaleMachine Learning for Parallel Performance AnalyticsReazul HoqueUniversity of TennesseeDynamic Task Discovery in PaRSEC- A Data-Flow Task-Based RuntimeAtsushi HoriRIKENP48: Prototyping of Offloaded Persistent Broadcast on Tofu2 InterconnectMuneo HoriUniversity of TokyoRIKENImplicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACCP09: Adaptive Multistep Predictor for Accelerating Dynamic Implicit Finite-Element SimulationsP23: AI with Super-Computed Data for Monte Carlo Earthquake Hazard ClassificationTakane HoriJapan Agency for Marine-Earth Science and TechnologyP21: The First Real-Scale DEM Simulation of a Sandbox Experiment Using 2.4 Billion ParticlesMasashi HorikoshiIntel CorporationP09: Adaptive Multistep Predictor for Accelerating Dynamic Implicit Finite-Element SimulationsWilliam Connor HorneNaval Research LaboratoryP18: A Parallel Python Implementation of BLAST+ (PPIB) for Characterization of Complex Microbial ConsortiaNaomi HospodarskyUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataKaixi HouVirginia TechExploring and Analyzing the Real Impact of Modern On-Package Memory on HPC Scientific KernelsMike HoustonNvidia CorporationProduction Deep Learning and Scale - Keynote by Mike Houston - Senior Distinguished Engineer - Deep Learning - NvidiaEffective Programming Models for Deep Learning at ScalePaul HovlandArgonne National LaboratoryTowards Self-Verification in Finite Difference Code GenerationAwards CeremonyLouis HowellLawrence Livermore National LaboratoryPredicting the Performance Impact of Different Fat-Tree ConfigurationsKevin HsiehCarnegie Mellon UniversityToward Standardized Near-Data Processing with Unrestricted Data Placement for GPUsTony HsuInventec CorporationTowards a Composable Computer SystemYang HuUniversity of Texas, DallasLocoFS: A Loosely-Coupled Metadata Service for Distributed File SystemsJianyu HuangUniversity of TexasLowering Barriers into HPC through Open EducationP02: Strassen's Algorithm for Tensor ContractionKangli HuangDelft University of TechnologyAdopting OpenCAPI for High Bandwidth Database AcceleratorsShan HuangChinese Academy of SciencesChinese Academy of SciencesTessellating StencilsXiaoping HuangNorthwestern Polytechnical UniversityRE-HASE: Regular-Expressions Hardware Synthesis EngineYingchao HuangUniversity of California, MercedUnimem: Runtime Data Management on Non-Volatile Memory-Based Heterogeneous Main MemoryMartin Huarte-EspinosaUniversity of HoustonVistas in Advanced ComputingKevin A. HuckUniversity of OregonInteractive HPC: Using C++ and HPX Inside Jupyterhub to Write Performant Portable Parallel CodeJan HuckelheimImperial College, LondonVerifying the Floating-Point Computation Equivalence of Manually and Automatically Differentiated CodeTowards Self-Verification in Finite Difference Code GenerationYectli HuertaUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataRon HuizenBittWare IncCooling Hot FPGAs: A Thermals First ApproachSaurabh HukerikarOak Ridge National LaboratoryFive-minute presentations by young researchers from around the world - part 2Alan HumphreyUniversity of UtahScientific Computing and Imaging InstituteAddressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUsWen-Mei HwuUniversity of IllinoisP16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering SolutionsThomas HänerETH Zurich0.5 Petabyte Simulation of a 45-Qubit Quantum CircuitThomas HéraultUniversity of TennesseeFault-Tolerance for High Performance and Distributed Computing: Theory and PracticeMarkus HöhnerbachRWTH Aachen UniversityA04: Optimization of the AIREBO Many-Body Potential for KNLReturn to TopICostin IancuLawrence Berkeley National LaboratoryIntroduction - PAW 2017: The 2nd Annual PGAS Applications WorkshopHuda IbeidUniversity of IllinoisP70: FFT, FMM, and Multigrid on the Road to Exascale: Performance Challenges and OpportunitiesTsuyoshi IchimuraUniversity of TokyoRIKENImplicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACCP09: Adaptive Multistep Predictor for Accelerating Dynamic Implicit Finite-Element SimulationsP23: AI with Super-Computed Data for Monte Carlo Earthquake Hazard ClassificationAkihiro IdaUniversity of TokyoKeynote - Application Development Framework for Manycore Architectures on Post-Peta/Exascale SystemsYasuhiro IdomuraJapan Atomic Energy AgencyApplication of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore PlatformsP17: Fully Non-Blocking Communication-Computation Overlap Using Assistant Cores toward Exascale ComputingShuichi IharaDataDirect NetworksA Configurable Rule-Based Classful Token Bucket Filter Network Request Scheduler for the Lustre File SystemAleksandar IlicINESC-IDModeling Large Compute Nodes with Heterogeneous Memories with the Cache-Aware Roofline ModelPerformance Tuning of Scientific Codes with the Roofline ModelAkira ImakuraUniversity of TsukubaEfficient and Scalable Calculation of Complex Band Structure Using Sakurai-Sugiura MethodToshiyuki ImamurarikenApplication of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore PlatformsConnor ImesUniversity of ChicagoP98: Energy Efficiency in HPC with Machine Learning and Control TheoryTakuya InaJapan Atomic Energy AgencyApplication of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore PlatformsMartins D. InnusUniversity at BuffaloA Slurm Simulator: Implementation and Parametric AnalysisKoji InoueKyushu UniversityP78: Performance Evaluation of Graph500 Considering CPU-DRAM Power ShiftingJoseph InsleyArgonne National LaboratoryFlexible In Situ Visualization of LAMMPS SimulationsParallel Streaming for In Transit Analysis with Heterogeneous Data LayoutLarge Scale Visualization with ParaViewVisualizing Silicene Growth Through Island Migration and CoalescenceBertrand IoossEDF FranceMelissa: Large Scale In Transit Global Sensitivity Analysis Avoiding Intermediate FilesAlexandru IosupVrije University AmsterdamP86: HyGraph: High Performance Graph Processing on Hybrid CPU+GPUs platformsYutaka IshikawaRIKENP48: Prototyping of Offloaded Persistent Broadcast on Tofu2 InterconnectHPC Impact Showcase: Healthcare and ManufacturingYuki ItoTokyo Institute of TechnologyP05: ooc_cuDNN : A Deep Learning Library Supporting CNNs over GPU Memory CapacityShigeru IwaseUniversity of TsukubaEfficient and Scalable Calculation of Complex Band Structure Using Sakurai-Sugiura MethodHidetoshi IwashitaRIKENPreliminary Performance Evaluation of Coarray-based Implementation of Fiber Miniapp Suite Using XcalableMP PGAS LanguageReturn to TopJChristiane JablonowskiUniversity of MichiganParallel Computing 101Adrian JacksonUniversity of EdinburghIntroduction - Women in HPC: Diversifying the HPC CommunityAmina JacksonNaval Research LaboratoryP18: A Parallel Python Implementation of BLAST+ (PPIB) for Characterization of Complex Microbial ConsortiaArpith JacobIBMImplementing Implicit OpenMP Data Sharing on GPUsSam Ade JacobsLawrence Livermore National LaboratoryToward Scalable Parallel Training of Deep Neural NetworksHeike JagodeUniversity of TennesseeUniversity of TennesseeP72: New Developments for PAPI 5.6+Magnus JahreNorwegian University of Science and TechnologyToward Aggregated Grain GraphsNikhil JainLawrence Livermore National LaboratoryPerformance Modeling under Resource Constraints Using Deep Transfer LearningPredicting the Performance Impact of Different Fat-Tree ConfigurationsModeling and Simulation of Communication in HPC SystemsP75: Model-Agnostic Influence Analysis for Performance DataWilliam JALBYVersailles Saint-Quentin-en-Yvelines UniversityWorkshop on Extreme-Scale Programming Tools (ESPT)Siddhartha JanaIntel CorporationState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersP95: GEOPM: A Scalable Open Runtime Framework for Power ManagementMatthias JanetschekUniversity of InnsbruckA Compiler Transformation-Based Approach to Scientific Workflow EnactmentDongmin JangKorea Institute of Science and Technology InformationVisualization of Decision-Making Support (DMS) Information for Responding to a Typhoon-Induced DisasterTomislav JanjusicMellanox TechnologiesWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Niclas JanssonKTH Royal Institute of TechnologyP24: A Deployment of HPC Algorithm into Pre/Post-Processing for Industrial CFD on K-ComputerJiri JarosBrno University of TechnologyP40: Running Large-Scale Ultrasound Simulations on Piz Daint with 512 Pascal GPUsStephen A. JarvisUniversity of WarwickIntroduction - The 8th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computer Systems (PMBS17)An Efficient Task-Based All-Reduce for Machine Learning ApplicationsThe 8th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computer Systems (PMBS17)Emmanuel JeannotFrench Institute for Research in Computer Science and Automation (INRIA)Modeling Large Compute Nodes with Heterogeneous Memories with the Cache-Aware Roofline ModelCross-Layer Allocation and Management of Hardware Resources in Shared Memory NodesElizabeth JessupUniversity of Colorado, BoulderCareers in HPCTest of Time Award Special LectureMorris JetteSchedMD LLCSlurm User Group MeetingXu JiTsinghua UniversityQatar Computing Research InstituteUnderstanding Object-Level Memory Access Patterns Across the SpectrumKenneth JiangNaval Research LaboratoryP18: A Parallel Python Implementation of BLAST+ (PPIB) for Characterization of Complex Microbial ConsortiaIvo JimenezUniversity of California, Santa CruzPractical Reproducibility by Managing Experiments Like SoftwareJudit JimenezBarcelona Supercomputing CenterWorkshop on Extreme-Scale Programming Tools (ESPT)Zheming JinArgonne National LaboratoryP46: Understanding How OpenCL Parameters Impact on Off-Chip Memory Performance of FPGA PlatformsMinsu JohKorea Institute of Science and Technology InformationVisualization of Decision-Making Support (DMS) Information for Responding to a Typhoon-Induced DisasterLizy K. JohnUniversity of TexasGPU Triggered Networking for Intra-Kernel CommunicationsChris JohnsonUniversity of UtahMedical Image Analysis and VisualizationExperiencing HPC for Undergraduates: Introduction to HPC ResearchTravis JohnstonOak Ridge National LaboratoryOptimizing Convolutional Neural Networks for Cloud DetectionAndrew JonesNumerical Algorithms GroupEssential HPC Finance Practice: Total Cost of Ownership (TCO), Internal Funding, and Cost-Recovery ModelsExtracting Value from HPC: Business Cases, Planning, and InvestmentHPC Acquisition and CommissioningCatherine JonesScience and Technology Facilities CouncilSoftware Engineers: Careers in ResearchMatthew D. JonesUniversity at BuffaloA Slurm Simulator: Implementation and Parametric AnalysisKirk JordanIBMHartree CentreMaking HPC Consumable: Helping Wet-Lab Chemists Access the Power of Computational MethodsThomas H. JordanUniversity of Southern CaliforniarvGAHP – Push-Based Job Submission Using Reverse SSH ConnectionsJithin JoseIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Keyur JoshiUniversity of IllinoisImplementation of a Cache Miss Calculator in LLVM/PollySebastien JourdainKitware IncIn Situ Summarization with VTK-mGuido JuckelandHelmholtz-Zentrum Dresden-RossendorfIntroduction - Fourth Workshop on Accelerator Programming Using Directives (WACCPD)The OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactOpenACC API User Experience, Vendor Reaction, Relevance, and RoadmapFourth Workshop on Accelerator Programming Using Directives (WACCPD)Christoph JunghansLos Alamos National LaboratoryP20: Facilitating the Scalability of ParSplice for Exascale TestbedsGideon JuveInformation Sciences Institute, University of Southern CaliforniarvGAHP – Push-Based Job Submission Using Reverse SSH ConnectionsThierry JéronFrench Institute for Research in Computer Science and Automation (INRIA)Verifying MPI Applications with SimGridMCReturn to TopKHumayun KabirPennsylvania State UniversityHierarchical Sparse Graph Computations on Multicore PlatformsDavid KahanerAsian Technology Information ProgramWelcome and IntroductionATIP Workshop on International Exascale and Next-Generation Computing ProgramsBhavya KailkhuraLawrence Livermore National LaboratoryPerformance Modeling under Resource Constraints Using Deep Transfer LearningHartmut KaiserLouisiana State UniversityHPX Smart ExecutorsInteractive HPC: Using C++ and HPX Inside Jupyterhub to Write Performant Portable Parallel CodeJürgen KaiserJohannes Gutenberg University MainzA Configurable Rule-Based Classful Token Bucket Filter Network Request Scheduler for the Lustre File SystemYuta KakibukaKyushu UniversityP78: Performance Evaluation of Graph500 Considering CPU-DRAM Power ShiftingLaxmikant KaleUniversity of IllinoisVisualizing, Measuring, and Tuning Adaptive MPI ParametersIntegrating OpenMP into the Charm++ Programming ModelMigratable Objects and Task-Based Parallel Programming with Charm++Vivek KaleUniversity of Southern CaliforniaP80: Adaptive Loop Scheduling with Charm++ to Improve Performance of Scientific ApplicationsDhiraj KalmakarIntel CorporationP31: Understanding the Performance of Small Convolution Operations for CNN on Intel ArchitectureSharan KalwaniDataSwingData Center Design and Planning for HPC FolksLaxmikant KaléUniversity of IllinoisCharmworks IncCharm++ and AMPI: Adaptive and Asynchronous Parallel ProgrammingYasushi KamataRailway Technical Research InstituteP22: Numerical Simulation of Snow Accretion by Airflow Simulator and Particle SimulatorSupun KamburugamuveIndiana UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesShoaib KamilAdobe ResearchSympiler: Transforming Sparse Matrix Codes by Decoupling Symbolic AnalysisKrishna KantTemple UniversityRevisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsSilent Errors in HPC SystemsLarry KaplanCray IncThe ARM Software Ecosystem: Are We There Yet?Karen KaravanicPortland State UniversityRevisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsIan KarlinLawrence Livermore National LaboratoryPredicting the Performance Impact of Different Fat-Tree ConfigurationsDataRaceBench: A Benchmark Suite for Systematic Evaluation of Data Race Detection ToolsP76: A Compiler Agnostic and Architecture Aware Predictive Modeling Framework for KernelsSven KarlssonTechnical University of DenmarkReconfigurable Computing in ExascaleYoshihiro KasaiFujitsu LtdP17: Fully Non-Blocking Communication-Computation Overlap Using Assistant Cores toward Exascale ComputingJulian Kates-HarbeckHarvard UniversityTraining Distributed Deep Recurrent Neural Networks with Mixed Precision on GPU ClustersDaniel S. KatzUniversity of IllinoisIntroduction - The 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Promoting Scientific WorkflowsExperiencing HPC for Undergraduates: Careers in HPCSoftware Engineering and Reuse in Computational Science and EngineeringSoftware Engineers: Careers in ResearchHigh Performance Computing Education in US Data ScienceState of the Practice: OperationsThe 2017 International Workshop on Software Engineering for High Performance Computing in Computational and Data-Enabled Science and Engineering (SE-CoDeSE 2017)Masatoshi KawaiUniversity of TokyoKeynote - Application Development Framework for Manycore Architectures on Post-Peta/Exascale SystemsPetr KaštovskýNetcope TechnologiesCase Study: Usage of High Level Synthesis in HPC NetworkingKate KeaheyArgonne National LaboratoryPractical Reproducibility by Managing Experiments Like SoftwareDistributed Computing and CloudsStephen KecklerNvidia CorporationUnderstanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and ApplicationsKristopher KeipertIowa State UniversityPorting a GAMESS Computational Chemistry Kernel to FPGAsAn Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi ProcessorP30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon PhiAnna Blue KeleherUniversity of MarylandA11: Finding a Needle in a Field of Haystacks: Lightweight Metadata Search for Large-Scale Distributed Research RepositoriesAlison KennedyHartree CentreIntroduction - Women in HPC: Diversifying the HPC CommunityGarrett KenyonLos Alamos National LaboratoryNew Mexico ConsortiumP88: PetaVision Neural Simulation Toolbox on Intel KNLsDarren KerbysonPacific Northwest National LaboratoryRepresentative Paths AnalysisDan KernsQuantum CorporationQuantum CorporationRook Distributed Storage SystemJanis KeuperFraunhofer Institute for Industrial MathematicsTensorQuant - A Simulation Toolbox for Deep Neural Network QuantizationIntroduction - Machine Learning in HPC EnvironmentsDavid KeyesKing Abdullah University of Science and TechnologyKAUST’s HiCMA Library: Hierarchical Computations on Manycore ArchitecturesHow Serious Are We About the Convergence Between HPC and Big Data?Software Engineering and Reuse in Computational Science and EngineeringWalid KeyrouzNational Institute for Standards and TechnologyComputational Reproducibility at Exascale 2017 (CRE2017)Zahra KhatamiLouisiana State UniversityHPX Smart ExecutorsGul rukh KhattakCERNP29: A Deep Learning Tool for Fast SimulationFarzad KhorasaniGeorgia Institute of TechnologyEnabling Work-Efficiency for High Performance Vertex-Centric Graph Analytics on GPUsS. E. KhudikyanJet Propulsion Laboratoryrealfast@VLASamuel KhuvisParaToolsP04: Unstructured-Grid CFD Algorithms on Many-Core ArchitecturesJohn KichuryHewlett Packard EnterpriseHPC in Space: Supercomputing at 17,500 MPHRon KikinisHarvard UniversityMedical Image Analysis and VisualizationEugene KikinzonLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesWilliam KillianMillersville University of PennsylvaniaUniversity of DelawareThe Design and Implementation of OpenMP 4.5 and OpenACC Backends for the RAJA C++ Performance Portability LayerP76: A Compiler Agnostic and Architecture Aware Predictive Modeling Framework for KernelsGwangsun KimARM LtdToward Standardized Near-Data Processing with Unrestricted Data Placement for GPUsHyunwoo KimKorea Institute of Science and Technology InformationP51: TuPiX-Flow: Workflow-Based Large-Scale Scientific Data Analysis SystemJeongnim KimIntel CorporationEmbracing a New Era of Highly Efficient and Productive Quantum Monte Carlo SimulationsJungwon KimOak Ridge National LaboratoryPapyrusKV: A High-Performance Parallel Key-Value Store for Distributed NVM ArchitecturesKyungjoo KimSandia National LaboratoriesDesigning Vector-Friendly Compact BLAS and LAPACK KernelsMark KimOak Ridge National LaboratoryIn Situ Visualization of Radiation Transport GeometryNam Ho KimUniversity of FloridaMulti-Fidelity Surrogate Modeling for Application/Architecture Co-DesignYoungjae KimSogang UniversityUnderstanding Object-Level Memory Access Patterns Across the SpectrumTagIt: An Integrated Indexing and Search Service for File SystemsJason S. KimkoCollege of William and MaryP79: Porting the Opacity Client Library to a CPU-GPU Cluster Using OpenMP 4.5Tom KingQueen Mary University of LondonOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureAnantha P. KinnalCalligo TechnologiesPosit ResearchPosit Math Unit (PMU) – A New Approach Toward Exascale ComputingAndrew C. KirbyUniversity of WyomingP28: High-Fidelity Blade-Resolved Wind Plant ModelingChristine KirkpatrickSan Diego Supercomputer CenterVirtualization Ecosystems – Supporting Increasingly Complex Scientific ApplicationsSherman J. KisnerHigh Performance Imaging LLCMassively Parallel 3D Image ReconstructionKevin D. KissellGoogleDeep Learning for Science in the CloudPer Gunnar KjeldsbergNorwegian University of Science and TechnologyTowards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core ArchitecturesUmayanganie KlaassenUniversity of Texas, El PasoPorting a GAMESS Computational Chemistry Kernel to FPGAsScott KlaskyOak Ridge National LaboratoryIntroduction - The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)MGARD: A Multilevel Technique for Compression of Floating-Point DataOptimizing the Query Performance of Block Index Through Data Analysis and I/O ModelingThe 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)Kerstin Kleese van DamBrookhaven National LaboratoryHPC Application Development ToolsLars KleinUniversity of MunsterPACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming ModelRomain KleinTransvalor SAAerominesSupercomputing for Everyone: Meeting the Growing Needs of BusinessesMichael KlemmIntel CorporationAdvanced OpenMP: Performance and 4.5 FeaturesMastering Tasking with OpenMPOpenMP: Enabling HPC for Twenty YearsOpenMP® is Twenty. Where Is It Going?Alicia KlinvexSandia National LaboratoriesBetter Scientific SoftwareHannah KlionUniversity of California, BerkeleyOak Ridge National LaboratoryP26: Optimizing Gravity and Nuclear Physics in FLASH for ExascaleSarah KnepperIntel CorporationDesigning Vector-Friendly Compact BLAS and LAPACK KernelsChristopher KnightArgonne National LaboratoryScalable In Situ Analysis of Molecular Dynamics SimulationsMatthew KnightMetamako LPA Networked-FPGA Platform Offering Flexible Ethernet Switching from Layer 1 All the Way to Full SDN via P4Aaron KnollUniversity of UtahFlexible In Situ Visualization of LAMMPS SimulationsAndreas KnuepferTechnical University DresdenPerformance Evaluation Tools Christina KochUniversity of WisconsinHPC Carpentry - Practical, Hands-On HPC TrainingGreg KoeingEnergy Efficient HPC Working GroupState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)Greg KoenigKPMGEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersPeter KoggeUniversity of Notre Dame15th Graph500 ListMemory-Centric Architectures for the Cloud and HPCA Case for Migrating Execution for Irregular ApplicationsHidetaka KoieNational Institute of Advanced Industrial Science and TechnologyP97: Profile Guided Kernel Optimization for Individual Container Execution on Bare-Metal ContainerBastian KollerHigh Performance Computing Center StuttgartHow Serious Are We About the Convergence Between HPC and Big Data?Martin KongBrookhaven National LaboratoryBenchmarking and Evaluating Unified Memory for OpenMP GPU OffloadingAlice KonigesLawrence Berkeley National LaboratoryOpenMP Common Core: A “Hands-On” ExplorationInteractive HPC: Using C++ and HPX Inside Jupyterhub to Write Performant Portable Parallel CodeRob KooperUniversity of IllinoisVirtualization Ecosystems – Supporting Increasingly Complex Scientific ApplicationsIsrael KorenUniversity of MassachusettsExperimental and Analytical Study of Xeon Phi ReliabilityTuomas S. KoskelaLawrence Berkeley National LaboratoryPerformance Tuning of Scientific Codes with the Roofline ModelDouglas KotheOak Ridge National LaboratoryExascale Challenges and OpportunitiesAnthony KougkasIllinois Institute of TechnologyEnosis: Bridging the Semantic Gap between File-Based and Object-Based Data ModelsSpiros KoulouzisUniversity of AmsterdamSeamless Infrastructure Customization and Performance Optimization for Time-Critical Services in Data InfrastructuresJelena KovacevicCarnegie Mellon UniversityP06: Large Scale FFT-Based Stress-Strain Simulations with Irregular Domain DecompositionPatricia KovatchIcahn School of Medicine at Mount SinaiMedical Image Analysis and VisualizationComputational Approaches for CancerImpacting Cancer with HPC: Opportunities and ChallengesQuincey KoziolLawrence Berkeley National LaboratoryThe HDF5 DataverseCassie KozyrkovGoogleDeep Learning for Science in the CloudMatthew KrafczykNational Center for Supercomputing Applications, University of IllinoisP91: Assessing the Availability of Source Code in Computational PhysicsReproducibility and Uncertainty in High Performance ComputingDieter KranzlmuellerLudwig Maximilian University of MunichPower-Aware High Performance Computing: Challenges and Opportunities for Application and System DevelopersJiri KrausNvidia CorporationApplication Porting and Optimization on GPU-Accelerated POWER ArchitecturesMichael KrauseGen-Z ConsortiumUnderstanding Gen-Z Technology – A High Performance Interconnect for the Data-Centric FutureMichal KravcenkoTechnical University of OstravaP03: BEM4I: A Massively Parallel Boundary Element SolverRebecca KreitingerUniversity of New MexicoP93: Spacehog: Evaluating the Costs of Dedicating Resources to In Situ AnalysisHamid KrimNorth Carolina State UniversityEgeria: A Framework for Auto-Construction of HPC Advising Tools through Multi-Layered Natural Language ProcessingSriram KrishnamoorthyPacific Northwest National LaboratoryWOLFHPC: Workshop on Domain-Specific Languages and High-Level Frameworks for High-Performance ComputingAutomatic Risk-Based Selective Redundancy for Fault-Tolerant Task-Parallel HPC ApplicationsSilent Errors in HPC SystemsP84: PRESAGE: Selective Low Overhead Error Amplification for Easy DetectionMads R. B. KristensenUniversity of CopenhagenExploring and Analyzing the Real Impact of Modern On-Package Memory on HPC Scientific KernelsMartin KronbichlerTechnical University MunichP08: Performance Optimization of Matrix-free Finite-Element Algorithms within deal.IIMark R. KrumholzAustralian National UniversityUniversity of California, Santa CruzMilky Way Analogue Isolated Disk GalaxyJeff KuehnLos Alamos National LaboratoryOpenSHMEM in the Era of ExascaleMichael KuhnUniversity of HamburgP57: Adaptive Tier Selection for NetCDF and HDF5Navjot KukrejaImperial College, LondonTowards Self-Verification in Finite Difference Code GenerationAnuva KulkarniCarnegie Mellon UniversityP06: Large Scale FFT-Based Stress-Strain Simulations with Irregular Domain DecompositionAbhishek KumarBrookhaven National LaboratoryA19: Performance Analysis of a Parallelized Restricted Boltzmann Machine Artificial Neural Network Using OpenACC Framework and TAU Profiling SystemNalini KumarUniversity of FloridaMulti-Fidelity Surrogate Modeling for Application/Architecture Co-DesignRick KumarSanmina CorporationBuilding End-to-End NVMe over Fabric Infrastructure for HPCVipin KumarUniversity of MinnesotaCommon Big Data Challenges in Bio, Geo, Climate, and Social SciencesKalyan KumaranArgonne National LaboratoryRun-to-Run Variability on Xeon Phi Based Cray XC SystemsJulian KunkelGerman Climate Computing CenterAnalyzing Parallel I/OThe Virtual Institute of I/O and the IO-500P57: Adaptive Tier Selection for NetCDF and HDF5P15: Toward Decoupling the Selection of Compression Algorithms from Quality ConstraintsShannon KuntzEmu Solutions Inc.A Case for Migrating Execution for Irregular ApplicationsThorsten KurthLawrence Berkeley National LaboratoryDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataGregory KurtzerSingularityWare LLCContainers in HPCJakub KurzakUniversity of TennesseeLinear Algebra Libraries for High-Performance Computing: Scientific Computing with Multicore and AcceleratorsReturn to TopLJesus LabartaBarcelona Supercomputing CenterKennedy Award Presentation: The Real Revolution … from the Latency to the Throughput AgeIgnacio LagunaLawrence Livermore National LaboratorySnowpack: Efficient Parameter Choice for GPU Kernels via Static Analysis and Statistical PredictionIntroduction - 1st International Workshop on Software Correctness for HPC Applications (Correctness 2017)REFINE: Realistic Fault Injection via Compiler-Based Instrumentation for Accuracy, Portability and SpeedCorrectness 2017: First International Workshop on Software Correctness for HPC ApplicationsKartik LakhotiaUniversity of Southern CaliforniaFive-minute presentations by young researchers from around the world - part 1Maddegedara LalithUniversity of TokyoRIKENP23: AI with Super-Computed Data for Monte Carlo Earthquake Hazard ClassificationDebra LamManaging Director for Smart Cities & Inclusive InnovationGeorgia Institute of TechnologyHPC Connects Plenary: The Century of the CityHerman LamUniversity of FloridaA FPGA-Pipelined Approach for Accelerated Discrete-Event Simulation of HPC SystemsMulti-Fidelity Surrogate Modeling for Application/Architecture Co-DesignReconfigurable Supercomputing (RSC)Michael LamJames Madison UniversityLawrence Livermore National LaboratoryImproving Numerical Computation with Practical Tools and Novel Computer ArithmeticSandy LandsbergUS Department of Defense HPC Modernization ProgramBlurring the Lines: High-End Computing and Data ScienceJonas L. LandsgesellUniversity of StuttgartP20: Facilitating the Scalability of ParSplice for Exascale TestbedsJoshua LandwehrPacific Northwest National LaboratoryVerification of the Extended Roofline Model for Asynchronous Many Task RuntimesP99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big DataMichael LangLos Alamos National LaboratoryNUMA Distance for Heterogeneous MemoryModeling UGAL on the Dragonfly TopologyA Comparative Study of SDN and Adaptive Routing on Dragonfly NetworksP55: Incorporating Proactive Data Rescue into ZFS Disk Recovery for Enhanced Storage ReliabilityMichael LangeImperial College, LondonTowards Self-Verification in Finite Difference Code GenerationUlrich LangenbachBeuth University of Applied Sciences BerlinHeterogeneous Multi-Processing in Software-Defined Cloud Storage NodesAkhil LangerIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Johannes LangguthSimula Research LaboratoryTowards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core ArchitecturesJames H. LarosSandia National LaboratoriesEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Total Cost of Ownership and HPC System ProcurementPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlMatthew LarsenLawrence Livermore National LaboratoryProjecting Performance Data Over Simulation Geometry Using SOSflow and AlpineThe ALPINE In Situ Infrastructure: Ascending from the Ashes of StrawmanRob LathamArgonne National LaboratoryParallel I/O in PracticeScott LathropUniversity of IllinoisFourth SC Workshop on Best Practices for HPC TrainingPromoting Scientific WorkflowsA Proposed Model for Teaching Advanced Parallel Computing and Related TopicsSoftware Engineering and Reuse in Computational Science and EngineeringDairsie LatimerRed Oak ConsultingHPC Software: Is “Cool Stuff” Really Incompatible with Sustainability?Casey J. LawUniversity of California, Berkeleyrealfast@VLAT. Joseph W. LazioJet Propulsion Laboratoryrealfast@VLAValentin Le FèvreENS LyonPeriodic I/O Scheduling for SupercomputersElizabeth LeakeSTEM-TrekSpecial Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)Christopher LeapUniversity of New MexicoP47: Understanding Congestion on Omni-Path FabricsMichael LeBeaneUniversity of TexasAdvanced Micro Devices IncGPU Triggered Networking for Intra-Kernel CommunicationsAnton LebedevUniversity of TubingenInvited Talk - On Improved Monte Carlo Hybrid Methods for Preconditioner ComputationsYouenn LebrasUniversity of VersaillesFive-minute presentations by young researchers from around the world - part 2Gregory L. LeeLawrence Livermore National LaboratoryManaging HPC Software Complexity with SpackHyungro LeeIndiana UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesJinho LeeIBMAdopting OpenCAPI for High Bandwidth Database AcceleratorsJunKyu LeeQueen's University BelfastP11: Energy-Efficient Transprecision Techniques for Iterative RefinementMatthew LeeCarnegie Mellon UniversityA Family of Provably Correct Algorithms for Exact Triangle CountingSeyong LeeOak Ridge National LaboratoryPorting a GAMESS Computational Chemistry Kernel to FPGAsPapyrusKV: A High-Performance Parallel Key-Value Store for Distributed NVM ArchitecturesWonchan LeeStanford UniversityControl Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical RegionsMiriam LeeserNortheastern UniversityComputational Reproducibility at Exascale 2017 (CRE2017)Reproducibility and Uncertainty in High Performance ComputingMatthew LegendreLawrence Livermore National LaboratoryManaging HPC Software Complexity with SpackJoshua LeibfriedUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataJohn D. LeidelTactical Computing LaboratoriesWorkshop for Open Source SupercomputingPressure-Driven Hardware Managed Thread Concurrency for Irregular ApplicationsBit Contiguous Memory Allocation for Processing In MemoryReconfigurable Computing in ExascaleJason LeighUniversity of Hawaii, ManoaSAGE2 9th Annual International SC BOF: Scalable Amplified Group Environment for Global CollaborationMatthew LeiningerLawrence Livermore National LaboratoryPredicting the Performance Impact of Different Fat-Tree ConfigurationsJacques Bernard LEKIENAtomic Energy and Alternative Energies CommissionLean Visualization of Large Scale Tree-Based AMR MeshesBryce A. LelbachLawrence Berkeley National LaboratoryInteractive HPC: Using C++ and HPX Inside Jupyterhub to Write Performant Portable Parallel CodeEdgar A. LeonLawrence Livermore National LaboratoryPredicting the Performance Impact of Different Fat-Tree ConfigurationsSiew Hoon LeongNational Supercomputing Center SingaporePosit ResearchPosit Math Unit (PMU) – A New Approach Toward Exascale ComputingRichard LethinReservoir Labs IncYale UniversitySmall Business and the Exascale Computing ProjectMary Ann LeungSustainable Horizons InstituteForming Connections I: Connecting Sideways, with Ourselves and Our PeersRandall LeVequeUniversity of WashingtonSoftware Engineering and Reuse in Computational Science and EngineeringJohn LevesqueCray IncFortran Is 60 Years Old - Has It Changed for the Better?Joshua A. LevineUniversity of ArizonaIntroduction - 4th International Workshop on Visual Performance Analytics – VPA 2017Panel Discussion: Challenges and the Future of HPC Performance VisualizationFourth International Workshop on Visual Performance Analysis – VPA 2017James LevittUniversity of TexasGeometry-Oblivious FMM for Compressing Dense SPD MatricesScott LevySandia National LaboratoriesP93: Spacehog: Evaluating the Costs of Dedicating Resources to In Situ AnalysisStuart A. LevyNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseAng LiPacific Northwest National LaboratoryExploring and Analyzing the Real Impact of Modern On-Package Memory on HPC Scientific KernelsChung-Gang LiKobe UniversityRIKENP24: A Deployment of HPC Algorithm into Pre/Post-Processing for Industrial CFD on K-ComputerDong LiUniversity of California, MercedUnimem: Runtime Data Management on Non-Volatile Memory-Based Heterogeneous Main MemoryP92: Characterization and Comparison of Application Resilience for Serial and Parallel ExecutionsGuanpeng LiUniversity of British ColumbiaUnderstanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and ApplicationsHongbo LiUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformParaStack: Efficient Hang Detection for MPI Programs at Large ScaleJiajia LiGeorgia Institute of TechnologyFive-minute presentations by young researchers from around the world - part 1P10: HiCOO: A Hierarchical Sparse Tensor Format for Tensor DecompositionsJiang LiSun Yat-Sen UniversityVisualizations of a High-Resolution Global-Regional Nested, Ice-Sea-Wave Coupled Ocean Model SystemLingda LiBrookhaven National LaboratoryDeveloping an OpenMP Runtime for UVM-Capable GPUsBenchmarking and Evaluating Unified Memory for OpenMP GPU OffloadingShaomeng LiNational Center for Atmospheric ResearchUniversity of OregonPerformance Impacts of In Situ Wavelet Compression on Scientific SimulationsSihuan LiUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformSiyang LiTsinghua UniversityTsinghua UniversityLocoFS: A Loosely-Coupled Metadata Service for Distributed File SystemsTao LiUniversity of FloridaLocoFS: A Loosely-Coupled Metadata Service for Distributed File SystemsTiffany LiNational Center for Supercomputing Applications, University of IllinoisP38: Benchmarking Parallelized File Aggregation Tools for Large Scale Data ManagementTonglin LiOak Ridge National LaboratoryIntroduction - The Eighth International Workshop on Data-Intensive Computing in the CloudsWeijun LiShenzhen DAPU Microelectronics CompanyIntroducing DPU - Data-Storage Processing Unit – Placing Intelligence in StorageXi LiDataDirect NetworksA Configurable Rule-Based Classful Token Bucket Filter Network Request Scheduler for the Lustre File SystemYan LiUniversity of California, Santa CruzUniversity of California, Santa CruzCAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement LearningP65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement LearningZhenyu LiUniversity of WarwickAn Efficient Task-Based All-Reduce for Machine Learning ApplicationsXin LiangUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformP53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaViewYishuang LiangBeijing Normal UniversityRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLightChunhua LiaoLawrence Livermore National LaboratoryDataRaceBench: A Benchmark Suite for Systematic Evaluation of Data Race Detection ToolsJunfeng LiaoTsinghua UniversityRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLightHyun LimBrigham Young UniversityP63: FleCSPH: a Parallel and Distributed Smoothed Particle Hydrodynamics Framework Based on FleCSISeung-Hwan LimOak Ridge National LaboratoryScientific User Behavior and Data-Sharing Trends in a Petascale File SystemTagIt: An Integrated Indexing and Search Service for File SystemsJames LinShanghai Jiao Tong UniversitySoftware Engineering and Reuse in Computational Science and EngineeringJin LinIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationLan LinBall State UniversityDesigning a Synchronization-Reducing Clustering Method on Manycores: Some Issues and ImprovementsMeifeng LinBrookhaven National LaboratoryThe OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactPei-Hung LinLawrence Livermore National LaboratoryVerifying the Floating-Point Computation Equivalence of Manually and Automatically Differentiated CodeDataRaceBench: A Benchmark Suite for Systematic Evaluation of Data Race Detection ToolsIris LinckUniversity of Colorado, DenverP20: Facilitating the Scalability of ParSplice for Exascale TestbedsPeggy LindnerUniversity of HoustonVistas in Advanced ComputingPeter LindstromLawrence Livermore National LaboratoryCompression of Scientific DataJohn C. LinfordParaToolsP04: Unstructured-Grid CFD Algorithms on Many-Core ArchitecturesHierarchical Memory Usage Andreas LintermannRWTH Aachen UniversityJuelich Aachen Research AllianceComprehensive Visualization of Large-Scale Simulation Data Linked to Respiratory Flow Computations on HPC SystemsDon LipariLawrence Livermore National LaboratoryP94: Fully Hierarchical Scheduling: Paving the Way to Exascale WorkloadsGengchen LiuUniversity of California, DavisSilicon Photonic LIONS: All-to-All Interconnects for Energy-Efficient, Scalable, and Modular HPC SystemsQing Gary LiuNew Jersey Institute of TechnologyIntroduction - The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)The 2nd International Workshop on Data Reduction for Big Scientific Data (DRBSD-2)Si LiuUniversity of TexasAdvanced Manycore Programming (KNL)Weifeng LiuUniversity of CopenhagenNorwegian University of Science and TechnologyExploring and Analyzing the Real Impact of Modern On-Package Memory on HPC Scientific KernelsWeiguo LiuShandong UniversityRedesigning CAM-SE for Petascale Climate Modeling Performance on Sunway TaihuLight15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight: Enabling Depiction of Realistic 10 Hz ScenariosXinlian LiuLawrence Berkeley National LaboratoryP36: A Novel Feature-Preserving Spatial Mapping for Deep Learning Classification of Ras StructuresYan LiuUniversity of MaineP72: New Developments for PAPI 5.6+P33: Massively Parallel Evolutionary Computation for Empowering Electoral Reform: Quantifying Gerrymandering via Multi-objective Optimization and Statistical AnalysisYuanlai LiuUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformKarl LjungkvistUppsala UniversityP08: Performance Optimization of Matrix-free Finite-Element Algorithms within deal.IILi-Ta LoLos Alamos National LaboratoryP53: TensorViz: Visualizing the Training of Convolutional Neural Network Using ParaViewJay LofsteadSandia National LaboratoriesSupercomputing in the Shadow of Giants: Perspectives and Insights from Supercomputing Leaders Outside the “Big 5” Regions and OrganizationsThe Virtual Institute of I/O and the IO-500Practical Reproducibility by Managing Experiments Like SoftwareBruce LoftisIndependentStudents@SC17 Welcome and Opening SessionGabriel H. LohAdvanced Micro Devices IncLeveraging Near Data Processing for High-Performance Checkpoint/RestartJulien LoiseauUniversity of Reims Champagne-ArdenneP63: FleCSPH: a Parallel and Distributed Smoothed Particle Hydrodynamics Framework Based on FleCSIJosip LoncaricLos Alamos National LaboratoryEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)Bill LongCray IncIntroduction - PAW 2017: The 2nd Annual PGAS Applications WorkshopDarrell D. E. LongUniversity of California, Santa CruzCAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement LearningP65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement LearningGuy LonsdaleScaposHPC Software: Is “Cool Stuff” Really Incompatible with Sustainability?Patty LopezIntel CorporationBuilding Your Professional PersonaBuilding Your Professional PersonaFrancesc-Josep LordanBarcelona Supercomputing CenterPolytechnic University of CataloniaEnabling GPU Support for the COMPSs-Mobile FrameworkBurlen LoringLawrence Berkeley National LaboratoryIn Situ Analysis and Visualization with SENSEIDominik Marek LorochFraunhofer Institute for Industrial MathematicsTensorQuant - A Simulation Toolbox for Deep Neural Network QuantizationGerald LottoMellanox TechnologiesInterconnect Your Future with Mellanox “Smart” InterconnectSteven G. LouieLawrence Berkeley National LaboratoryUniversity of California, BerkeleyP13: Large-Scale GW Calculations on Pre-Exascale HPC SystemsTze Meng LowCarnegie Mellon UniversityA Family of Provably Correct Algorithms for Exact Triangle CountingMike LoweIndiana UniversityOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureDavid LowenthalUniversity of ArizonaEnergy Efficient Supercomputing (E2SC)Power-Aware High Performance Computing: Challenges and Opportunities for Application and System DevelopersHatem LtaiefKing Abdullah University of Science and TechnologyHow Serious Are We About the Convergence Between HPC and Big Data?Hao LuOak Ridge National LaboratorySpherical Region Queries on Multicore ArchitecturesXiaoyi LuOhio State UniversityBig Data Meets HPC: Exploiting HPC Technologies for Accelerating Big Data Processing and ManagementBuilding Efficient Clouds for HPC, Big Data, and Deep Learning Middleware and ApplicationsAccelerating Big Data Processing and Machine/Deep Learning Middleware on Modern HPC ClustersScalable Reduction Collectives with Data Partitioning-Based Multi-Leader DesignYouyou LuTsinghua UniversityLocoFS: A Loosely-Coupled Metadata Service for Distributed File SystemsYutong LuSun Yat-Sen UniversityVisualizations of a High-Resolution Global-Regional Nested, Ice-Sea-Wave Coupled Ocean Model SystemRobert F. LucasUniversity of Southern CaliforniaInvited Talks 1Juan Lucio-VegaUniversity of DelawareThe OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactSebastian LuehrsForschungszentrum JuelichJuelich Supercomputing CenterP87: EoCoE Performance Benchmarking Methodology for Renewable Energy ApplicationsJakob LuettgauGerman Climate Computing CenterP57: Adaptive Tier Selection for NetCDF and HDF5Hui LuiUniversity of IllinoisSimulation and Visual Representation of Tropical Cyclone-Ocean InteractionsRonald Peter LuijtenIBMDOME Hot-Water Cooled MicroDataCenterJustin P. LuitjensNvidia CorporationP04: Unstructured-Grid CFD Algorithms on Many-Core ArchitecturesAndrew LumsdainePacific Northwest National LaboratoryUniversity of WashingtonThinking Strategically15th Graph500 ListElias LundmarkUniversity West SwedenP44: Increasing Throughput of Multiprogram HPC Workloads: Evaluating a SMT Co-Scheduling ApproachThomas LundqvistUniversity West SwedenP44: Increasing Throughput of Multiprogram HPC Workloads: Evaluating a SMT Co-Scheduling ApproachXi LuoUniversity of TennesseeData Analysis of Earth System Simulation within an In Situ InfrastructureYe LuoArgonne National LaboratoryEmbracing a New Era of Highly Efficient and Productive Quantum Monte Carlo SimulationsYingyi LuoNorthwestern UniversityP46: Understanding How OpenCL Parameters Impact on Off-Chip Memory Performance of FPGA PlatformsZiqing LuoUniversity of DelawareTowards Self-Verification in Finite Difference Code GenerationP83: Contracts for Message-Passing ProgramsFabio LuporiniImperial College, LondonTowards Self-Verification in Finite Difference Code GenerationPiotr LuszczekUniversity of TennesseeBatched, Reproducible, and Reduced Precision BLASBenjamin LynchUniversity of MinnesotaOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureCeph Applications in HPC EnvironmentsP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataGeoff LyonCoolIT Systems IncChip-to-Atmosphere: Providing Safe and Effective Cooling for High-Density, High-Performance Data Center EnvironmentsMichael LysaghtIrish Centre for High End ComputingIntroduction - H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable Logic H2RC: Third International Workshop on Heterogeneous Computing with Reconfigurable LogicReturn to TopMPrabhat MLawrence Berkeley National LaboratoryEffective Programming Models for Deep Learning at ScaleXiaosong MaQatar Computing Research InstituteUnderstanding Object-Level Memory Access Patterns Across the SpectrumBarney MaccabeOak Ridge National LaboratoryPerformance, Advancement, and PromotionsLalith MaddegedaraUniversity of TokyoRIKENImplicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACCP09: Adaptive Multistep Predictor for Accelerating Dynamic Implicit Finite-Element SimulationsElizabeth H. MaddenLudwig Maximilian University of MunichExtreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust EarthquakeKavitha MadhuArgonne National LaboratoryMPICH: A High-Performance Open-Source MPI ImplementationPhilip J. MaechlingUniversity of Southern CaliforniarvGAHP – Push-Based Job Submission Using Reverse SSH ConnectionsShinya MaeyamaNagoya UniversityP17: Fully Non-Blocking Communication-Computation Overlap Using Assistant Cores toward Exascale ComputingDonald MaghrakKrell InstituteHow To Analyze the Performance of Parallel Codes 101Tom MaidenPittsburgh Supercomputing CenterFrom Outreach to Education to Researcher - Innovative Ways of Expanding the HPC CommunityLiudmila S. MainzerNational Center for Supercomputing Applications, University of IllinoisP38: Benchmarking Parallelized File Aggregation Tools for Large Scale Data ManagementMatthias MaiterthIntel CorporationState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersPreeti MalakarArgonne National LaboratoryScalable In Situ Analysis of Molecular Dynamics SimulationsTareq MalasLawrence Berkeley National LaboratoryDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataAbdul N. Malmi-KakkadaUniversity of TexasPhysical Signatures of Cancer MetastasisChris M. MaloneLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesAllen D. MalonyUniversity of OregonWorkshop on Extreme-Scale Programming Tools (ESPT)Projecting Performance Data Over Simulation Geometry Using SOSflow and AlpinePerformance TuningCarlos MaltzahnUniversity of California, Santa CruzPractical Reproducibility by Managing Experiments Like SoftwareLukas MalyTechnical University of OstravaP03: BEM4I: A Massively Parallel Boundary Element SolverVani MandavaMicrosoftIntroduction - MTAGS17: 10th Workshop on Many-Task Computing on Clouds, Grids, and SupercomputersCloud Computing for Science and EngineeringKeynote: Cloud based systems and challenges for data rich research workloadsMTAGS17: 10th Workshop on Many-Task Computing on Clouds, Grids, and SupercomputersAndreas MangUniversity of TexasA Framework for Scalable Biophysics-Based Image AnalysisPavlos ManiotisAristotle University of ThessalonikiComputing Architectures Exploiting Optical Interconnect and Optical Memory TechnologiesFilippo MantovaniBarcelona Supercomputing CenterThe ARM User Experience: Testbeds and Deployment at HPC CentersP71: Is ARM Software Ecosystem Ready for HPC?Joseph ManzanoPacific Northwest National LaboratoryVerification of the Extended Roofline Model for Asynchronous Many Task RuntimesP99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big DataAniruddha MaratheLawrence Livermore National LaboratoryScrubJay: Deriving Knowledge from the Disarray of HPC Performance DataPerformance Modeling under Resource Constraints Using Deep Transfer LearningMartin MargalaUniversity of Massachusetts, LowellRE-HASE: Regular-Expressions Hardware Synthesis EngineGeorge S. MarkomanolisKing Abdullah University of Science and TechnologyGetting Started with the Burst Buffer: Using DataWarp TechnologyAndres MarquezPacific Northwest National LaboratoryEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Verification of the Extended Roofline Model for Asynchronous Many Task RuntimesExploring and Analyzing the Real Impact of Modern On-Package Memory on HPC Scientific KernelsP99: The Intersection of Big Data and HPC: Using Asynchronous Many Task Runtime Systems for HPC and Big DataThomas MarrinanUniversity of St. ThomasA Path from Serial Execution to Hybrid Parallelization for Learning HPCParallel Streaming for In Transit Analysis with Heterogeneous Data LayoutDavid MartinArgonne National LaboratoryHPC Impact Showcase: Defense SystemsSteven MartinCray IncEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Total Cost of Ownership and HPC System ProcurementPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlDavid MartinezSandia National LaboratoriesEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Jan MartinovicTechnical University of OstravaP62: How To Do Machine Learning on Big ClustersNaoya MaruyamaRIKENP41: OpenCL-Based High-Performance 3D Stencil Computation on FPGAsMichael MascagniFlorida State UniversityComputational Reproducibility at Exascale 2017 (CRE2017)Matt MastenIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationSergi Mateo BellidoBarcelona Supercomputing CenterMastering Tasking with OpenMPAmrita MathuriyaIntel CorporationEmbracing a New Era of Highly Efficient and Productive Quantum Monte Carlo SimulationsRyo MatsumiyaTokyo Institute of TechnologyP05: ooc_cuDNN : A Deep Learning Library Supporting CNNs over GPU Memory CapacityKazuya MatsumotoUniversity of AizuApplication of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore PlatformsSatoshi MatsuokaTokyo Institute of Technology2nd International Workshop on Post Moore's Era Supercomputing (PMES)Applying Temporal Blocking with a Directive-Based ApproachBlurring the Lines: High-End Computing and Data ScienceEnergy Efficiency Gains From Software: Retrospectives and PerspectivesP41: OpenCL-Based High-Performance 3D Stencil Computation on FPGAsP52: A Simulation-Based Analysis on the Configuration of Burst BufferDevin A. MatthewsUniversity of TexasP02: Strassen's Algorithm for Tensor ContractionGreg MatthewsNASA Ames Research CenterPBS Pro Open Source Project Community BoFMichael MattmillerChief Technology OfficerCity of SeattleHPC Connects Plenary: The Century of the CityMarta MattosoFederal University of Rio de JaneiroTracking of Online Parameter Fine-Tuning in Scientific WorkflowsTimothy MattsonIntel CorporationOpenMP Common Core: A “Hands-On” ExplorationProgramming Your GPU with OpenMP: A Hands-On IntroductionHPC Graph Toolkits and the GraphBLAS ForumZakhar A. MatveevIntel CorporationPerformance Tuning of Scientific Codes with the Roofline ModelAlexander MatzUniversity of HeidelbergP85: GPU Mekong: Simplified Multi-GPU Programming Using Automated PartitioningDimitri J. MavriplisUniversity of WyomingP28: High-Fidelity Blade-Resolved Wind Plant ModelingYury F. MaydanikInstitute of Thermal Physics Ural BranchThercon-LHPFuture of the Thermal Management – Thercon-LHP Water-Free Solutions for HPC CoolingRobert MaynardKitware IncIn Situ Summarization with VTK-mAkie MayumiJapan Atomic Energy AgencyApplication of a Communication-Avoiding Generalized Minimal Residual Method to a Gyrokinetic Five Dimensional Eulerian Code on ManyCore PlatformsPatrick McCormickLos Alamos National LaboratoryOpenMPIRControl Replication: Compiling Implicit Parallelism to Efficient SPMD with Logical RegionsKenton McHenryNational Center for Supercomputing Applications, University of IllinoisVirtualization Ecosystems – Supporting Increasingly Complex Scientific ApplicationsLois Curfman McInnesArgonne National LaboratorySoftware Engineering and Reuse in Computational Science and EngineeringSuzanne McIntoshNew York UniversitySecond Annual Meeting of the SIGHPC - Big Data ChapterSimon McIntosh-SmithUniversity of BristolA Survey of Application Memory Usage on a National Supercomputer: An Analysis of Memory Requirements on ARCHERThe ARM Software Ecosystem: Are We There Yet?Programming Your GPU with OpenMP: A Hands-On IntroductionThe ARM User Experience: Testbeds and Deployment at HPC CentersP69: Portable Methods for Measuring Cache Hierarchy PerformanceP96: Correcting Detectable Uncorrectable Errors in MemoryRobert McLayUniversity of TexasTexas Advanced Computing Center, University of TexasTracking and Analyzing Job-level Activity Using Open XDMoD, XALT and OGRTMatt McLeanUniversity of MichiganThe ARM Software Ecosystem: Are We There Yet?Kim McMahonMcMahon ConsultingIntroduction - Women in HPC: Diversifying the HPC CommunityStephen McNallyOak Ridge National LaboratoryCharacterizing Faults, Errors, and Failures in Extreme-Scale SystemsDavid MeadowsStulz Air TechnologyChip-to-Atmosphere: Providing Safe and Effective Cooling for High-Density, High-Performance Data Center EnvironmentsRobert L. MeakinUS Department of Defense HPC Modernization ProgramAccelerating Defense Innovation of Military Aircraft with Computational Prototypes and High Performance ComputingMiriam MehlUniversity of StuttgartA Framework for Scalable Biophysics-Based Image AnalysisMaryam Mehri DehnaviRutgers UniversitySympiler: Transforming Sparse Matrix Codes by Decoupling Symbolic AnalysisSusan MehringerCornell UniversityFourth SC Workshop on Best Practices for HPC TrainingLars MejsnerGrundfosGrundfosBenefits of Having Sensors in Your Water Cooled HPCMario MelaraNational Energy Research Scientific Computing CenterManaging HPC Software Complexity with SpackMads MelchiorsGrundfosGrundfosBenefits of Having Sensors in Your Water Cooled HPCNathaniel MendozaTexas Advanced Computing Center, University of TexasSecuring HPC: Development of a Low Cost, Open Source, Multi-Factor Authentication InfrastructureHarshitha MenonLawrence Livermore National LaboratoryVerifying the Floating-Point Computation Equivalence of Manually and Automatically Differentiated CodeIntegrating OpenMP into the Charm++ Programming ModelP80: Adaptive Loop Scheduling with Charm++ to Improve Performance of Scientific ApplicationsMichal MertaTechnical University of OstravaP03: BEM4I: A Massively Parallel Boundary Element SolverBronson MesserOak Ridge National LaboratoryApplication Porting and Optimization on GPU-Accelerated POWER ArchitecturesP26: Optimizing Gravity and Nuclear Physics in FLASH for ExascalePaul MessinaArgonne National LaboratoryThe U.S. D.O.E. Exascale Computing Project – Goals and ChallengesPeter MessmerNvidia CorporationInteractivity in SupercomputingMartin MeuerISC EventsTOP500 SupercomputersTOP500 - Past, Present, FutureLauren MichaelUniversity of WisconsinSoftware Engineers: Careers in ResearchScott MichaelIndiana UniversityStudent Résumé WorkshopMarek MichalewiczUniversity of WarsawSupercomputing in the Shadow of Giants: Perspectives and Insights from Supercomputing Leaders Outside the “Big 5” Regions and OrganizationsMartial MichelData Machines CorpOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureGeorge MichelogiannakisLawrence Berkeley National LaboratoryPost Moore SupercomputingPARADISE: A ToolFlow to Model Emerging Technologies for the Post-CMOS Era in HPCSamuel P. MidkiffPurdue UniversityMassively Parallel 3D Image ReconstructionEthan L. MillerUniversity of California, Santa CruzCAPES: Unsupervised Storage Performance Tuning Using Neural Network-Based Deep Reinforcement LearningP65: CAPES: Unsupervised System Performance Tuning Using Neural Network-Based Deep Reinforcement LearningPhil MillerCharmworks IncCharm++ and AMPI: Adaptive and Asynchronous Parallel ProgrammingRoss MillerOak Ridge National LaboratoryGUIDE: A Scalable Information Directory Service to Collect, Federate, and Analyze Logs for Operational Insights into a Leadership HPC FacilityMichelle StroutUniversity of ArizonaSympiler: Transforming Sparse Matrix Codes by Decoupling Symbolic AnalysisJeff MilrodBittWare IncCooling Hot FPGAs: A Thermals First ApproachDaniel J. MilroyNational Center for Atmospheric ResearchQuality Assurance and Error Identification for the Community Earth System ModelMisun MinArgonne National LaboratoryWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Vladimir MironovLomonosov Moscow State UniversityAn Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi ProcessorP30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon PhiP37: PaSTRI: A Novel Data Compression Algorithm for Two-Electron Integrals in Quantum ChemistryAlok MishraStony Brook UniversityBenchmarking and Evaluating Unified Memory for OpenMP GPU OffloadingSatyajayant MisraNew Mexico State UniversityA Scalable Analytical Memory Model for CPU Performance PredictionJerome MitchellIndiana UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesIoannis MitliagkasStanford UniversityDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataKatsunori MiuraKitami Institute of TechnologyP61: Cloud Resource Selection Based on PLS Method for Deploying Optimal Infrastructures for Genomic Analytics ApplicationMathew MixUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataAudris MockusUniversity of TennesseePosition Paper: Experiences on Clustering High-Dimensional Data Using pbdRMohamed MohamedIBMP58: Wharf: Sharing Docker Images across Hosts from a Distributed FilesystemAli MohammedUniversity of BaselP74: A Methodology for Bridging the Native and Simulated Executions of Parallel ApplicationsAyat MohammedTexas Advanced Computing Center, University of TexasPhysical Signatures of Cancer MetastasisKathryn MohrorLawrence Livermore National LaboratoryOptimizing MPISimon MollSaarland UniversityPACXXv2 + RV -- An LLVM-Based Portable High-Performance Programming ModelMd Atiqul MollahFlorida State UniversityModeling UGAL on the Dragonfly TopologyA Comparative Study of SDN and Adaptive Routing on Dragonfly NetworksModeling and Comparison of Large-Scale Interconnect DesignsShintaro MomoseNEC CorporationProject Aurora – Unveiling NEC’s Brand New Vector SupercomputerJose Monsalve DIazUniversity of DelawareOpenMP 4.5 Validation and Verification SuiteRaffaele MontellaParthenope University of NaplesProcessing of Crowd-Sourced Data from an Internet of Floating ThingsDavid MontoyaLos Alamos National LaboratoryHow To Analyze the Performance of Parallel Codes 101State of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)Shirley V. MooreOak Ridge National LaboratoryPorting a GAMESS Computational Chemistry Kernel to FPGAsKenneth MorelandSandia National LaboratoriesLarge Scale Visualization with ParaViewYoshiyuki MorieRIKENP48: Prototyping of Offloaded Persistent Broadcast on Tofu2 InterconnectVitali MorozovArgonne National LaboratoryRun-to-Run Variability on Xeon Phi Based Cray XC SystemsPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlKarla MorrisSandia National LaboratoriesIntroduction - PAW 2017: The 2nd Annual PGAS Applications WorkshopPAW 2017: The 2nd Annual PGAS Applications WorkshopWilliam MosesMassachusetts Institute of TechnologyOpenMPIRAlexander MoskovskyRSC TechnologiesAn Efficient MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation of Intel Xeon Phi ProcessorP30: MPI/OpenMP Parallelization of the Hartree-Fock Method for the Second Generation Intel Xeon PhiNicholas MossLos Alamos National LaboratoryP63: FleCSPH: a Parallel and Distributed Smoothed Particle Hydrodynamics Framework Based on FleCSIMisbah MubarakArgonne National LaboratoryIntroduction - Women in HPC: Diversifying the HPC CommunityCareer Panel Discussion: Hints and Tips to Progress Your CareerHow to Find the Help You Need – Identifying Mentors and Those Who Can Help You in Your CareerEarly Career Lightning TalksVirtual Poster Networking and MixerPredicting the Performance Impact of Different Fat-Tree ConfigurationsModeling and Simulation of Communication in HPC SystemsGihan MudaligeUniversity of WarwickComparison of Parallelization Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUsBeyond 16GB: Out-of-Core Stencil ComputationsP01: Cache-Blocking Tiling of Large Stencil Codes at RuntimeAnanya MuddukrishnaNorwegian University of Science and TechnologyToward Aggregated Grain GraphsFrank MuellerNorth Carolina State UniversityP89: Desh: Deep Learning for HPC System Health ResilienceMichel MuellerTokyo Institute of TechnologyHybrid Fortran: High Productivity GPU Porting Framework Applied to Japanese Weather Prediction ModelBenson MuiteUniversity of TartuA Comparison of Distributed Memory Fast Fourier Transform (FFT) Library PackagesYvo MulderDelft University of TechnologyAdopting OpenCAPI for High Bandwidth Database AcceleratorsJulia MullenMassachusetts Institute of TechnologyFourth SC Workshop on Best Practices for HPC TrainingFrom Outreach to Education to Researcher - Innovative Ways of Expanding the HPC CommunityMasaharu MunetomoHokkaido UniversityP61: Cloud Resource Selection Based on PLS Method for Deploying Optimal Infrastructures for Genomic Analytics ApplicationEdward A. MunsellUniversity of MinnesotaP60: Managing dbGaP Data with Stratus, a Research Cloud for Protected DataTodd MunsonArgonne National LaboratoryScalable In Situ Analysis of Molecular Dynamics SimulationsHitoshi MuraiRIKENRuntime Correctness Checking for Emerging Programming ParadigmsPreliminary Performance Evaluation of Coarray-based Implementation of Fiber Miniapp Suite Using XcalableMP PGAS LanguageKohei MurotaniRailway Technical Research InstituteP22: Numerical Simulation of Snow Accretion by Airflow Simulator and Particle SimulatorPhilip MurphyIntel CorporationOmni-Path User Group (OPUG) MeetingRichard MurphyMicron Technology Inc15th Graph500 ListMargaret E. MyersUniversity of TexasLowering Barriers into HPC through Open EducationMatthias S. MüllerRWTH Aachen UniversityRuntime Correctness Checking for Emerging Programming ParadigmsReturn to TopNJarek NabrzyskiUniversity of Notre DamePromoting Scientific WorkflowsJaroslaw NabrzyskiUniversity of Notre DameVirtualization Ecosystems – Supporting Increasingly Complex Scientific ApplicationsYasodhadevi NachimuthuPortland State UniversityExperiencing HPC for Undergraduates: Graduate Student PerspectiveRamkumar NagappanIntel CorporationEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)PowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlMasato NaitoSumitomo Rubber Industries LtdDevelopment of High Performance Rubber Materials for Tires Using K ComputerKoji NakadeRailway Technical Research InstituteP22: Numerical Simulation of Snow Accretion by Airflow Simulator and Particle SimulatorKengo NakajimaUniversity of TokyoKeynote - Application Development Framework for Manycore Architectures on Post-Peta/Exascale SystemsImplicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACCSoftware Engineering and Reuse in Computational Science and EngineeringP14: Robust SA-AMG Solver by Extraction of Near-Kernel VectorsACM Student Research Competition: Presentations by Semi-FinalistsPoster ReceptionACM Student Research CompetitionMasahiro NakaoRIKENPreliminary Performance Evaluation of Coarray-based Implementation of Fiber Miniapp Suite Using XcalableMP PGAS LanguageMotoki NakataNational Institute for Fusion ScienceP17: Fully Non-Blocking Communication-Computation Overlap Using Assistant Cores toward Exascale ComputingHai Ah NamLos Alamos National LaboratoryUsability, Scalability and Productivity on Many-Core Processors: Intel Xeon PhiKumudha NarasimhanIndian Institute of ScienceOptimizing Geometric Multigrid Method Computation Using a DSL ApproachBadri NarayananArgonne National LaboratoryVisualizing Silicene Growth Through Island Migration and CoalescenceRevathi NarayananMicron Technology IncThe Silver Lining of the Cloud is the EDGERupesh NasreIndian Institutes of Technology,Five-minute presentations by young researchers from around the world - part 2HPC Initiatives in IndiaThomas NaughtonOak Ridge National LaboratoryP59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing EnvironmentsMaxim NaumovNvidia CorporationParallel Jaccard and Related Graph Clustering TechniquesParallel Depth-First Search for Directed Acyclic GraphsPhilippe NavauxFederal University of Rio Grande do SulExperimental and Analytical Study of Xeon Phi ReliabilityAmericas HPC CollaborationMohammadamin NazirzadehUniversity of California, DavisP50: Energy-Efficient and Scalable Bio-Inspired Nanophotonic ComputingP49: Toward Exascale HPC Systems: Exploiting Advances in High Bandwidth Memory (HBM2) through Scalable All-to-All Optical Interconnect ArchitecturesAravind NeelakantanUniversity of FloridaMulti-Fidelity Surrogate Modeling for Application/Architecture Co-DesignHenry NeemanUniversity of OklahomaFourth SC Workshop on Best Practices for HPC TrainingChris J. NewburnNvidia CorporationThe ARM Software Ecosystem: Are We There Yet?Esmond G. NgLawrence Berkeley National LaboratoryPlenary Invited TalkBao NguyenWashington State University, VancouverLarge-Scale Adaptive Mesh Simulations Through Non-Volatile Byte-Addressable MemoryP25: Large-Scale Adaptive Mesh Simulations Through Non-Volatile Byte-Addressable MemoryHoang NguyenUniversity of QueenslandFive-minute presentations by young researchers from around the world - part 1Eric J. NielsenNASA Langley Research CenterP04: Unstructured-Grid CFD Algorithms on Many-Core ArchitecturesDimitrios NikolopoulosQueen's University BelfastEnergy Efficient Supercomputing (E2SC)REFINE: Realistic Fault Injection via Compiler-Based Instrumentation for Accuracy, Portability and SpeedP11: Energy-Efficient Transprecision Techniques for Iterative RefinementTeodor NikolovMarie Skłodowska Curie Initial Training NetworksFive-minute presentations by young researchers from around the world - part 2Daisuke NishiuraJapan Agency for Marine-Earth Science and TechnologyP21: The First Real-Scale DEM Simulation of a Sandbox Experiment Using 2.4 Billion ParticlesBill NitzbergAltair EngineeringPBS Pro Open Source Project Community BoFAsare NkansahUniversity of KentuckyA Path from Serial Execution to Hybrid Parallelization for Learning HPCKelly NolanTalent StrategySelf Branding and Advocacy: How to Get Known in Your Organization and Push Your Ideas ForwardJean-Phillippe NomineFrench Alternative Energies and Atomic Energy CommissionFrench HPC Ecosystem and Strategy and the Role of CEAJean-Philippe NominéEuropean Technology Platform for High Performance ComputingFrench Alternative Energies and Atomic Energy CommissionEuropean Exascale Projects and Their Global ContributionsNaoya NomuraUniversity of TokyoP14: Robust SA-AMG Solver by Extraction of Near-Kernel VectorsMichael L. NormanSan Diego Supercomputer CenterUniversity of California, San DiegoFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseBoyana NorrisUniversity of OregonCompilation TechniquesAnastasiia NovikovaUniversity of HamburgP15: Toward Decoupling the Selection of Compression Algorithms from Quality ConstraintsLucy NowellUS Department of EnergyVPA Keynote: Visual Performance Analysis for Extremely Heterogeneous SystemsSmall Business and the Exascale Computing ProjectMasanori NunamiNational Institute for Fusion ScienceP17: Fully Non-Blocking Communication-Computation Overlap Using Assistant Cores toward Exascale ComputingReturn to TopOKevin O'BrienIBMImplementing Implicit OpenMP Data Sharing on GPUsMike O'ConnorNvidia CorporationToward Standardized Near-Data Processing with Unrestricted Data Placement for GPUsPatrick O'LearyKitware IncIntroduction - ISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationIn Situ Summarization with VTK-mISAV 2017: In Situ Infrastructures for Enabling Extreme-Scale Analysis and VisualizationBrian W. O'SheaMichigan State UniversityFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseSteve OberlinNvidia CorporationHow Serious Are We About the Convergence Between HPC and Big Data?Michael ObersteinerTechnical University MunichA Highly Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma SimulationsSergey OblomovIntel CorporationWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Lena OdenArgonne National LaboratoryWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Seda Ogrenci-MemikNorthwestern UniversityP46: Understanding How OpenCL Parameters Impact on Off-Chip Memory Performance of FPGA PlatformsMartin OhlerichLeibniz Supercomputing CentreP08: Performance Optimization of Matrix-free Finite-Element Algorithms within deal.IIDaniel OliveiraFederal University of Rio Grande do SulExperimental and Analytical Study of Xeon Phi ReliabilityStephen OliverSandia National LaboratoriesPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlStephen OlivierSandia National LaboratoriesOpenMPIRLuke OlsonUniversity of IllinoisP70: FFT, FMM, and Multigrid on the Road to Exascale: Performance Challenges and OpportunitiesHensley OmorodionUniversity of BeninSpecial Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)Theodore OmtzigtStillwater Supercomputing IncPosit ResearchPosit Math Unit (PMU) – A New Approach Toward Exascale ComputingKeiji OnishiRIKENP24: A Deployment of HPC Algorithm into Pre/Post-Processing for Industrial CFD on K-ComputerTakatsugu OnoKyushu UniversityP78: Performance Evaluation of Graph500 Considering CPU-DRAM Power ShiftingTomoya OnoUniversity of TsukubaEfficient and Scalable Calculation of Complex Band Structure Using Sakurai-Sugiura MethodGopalan OppiliappanIntel CorporationHigh Performance Computing Education in US Data ScienceSarp OralOak Ridge National LaboratoryGUIDE: A Scalable Information Directory Service to Collect, Federate, and Analyze Logs for Operational Insights into a Leadership HPC FacilityLustre Community BoF: Lustre Deployments for the Next 5 YearsJason OrenderOld Dominion UniversityP04: Unstructured-Grid CFD Algorithms on Many-Core ArchitecturesMarc S. OrrUniversity of WisconsinGravel: Fine-Grain GPU-Initiated Network MessagesSergio OrtegaUniversity of MalagaParallware Trainer: Interactive Tool for Experiential Learning of Parallel Programming Using OpenMP and OpenACCSamuel OshinIntel CorporationRun-to-Run Variability on Xeon Phi Based Cray XC SystemsMark OskinAdvanced Micro Devices IncUniversity of WashingtonGravel: Fine-Grain GPU-Initiated Network MessagesPaul OsmialowskiARM LtdHow The Flang Frontend Works - Introduction to the Interior of the Open-Source Fortran Frontend for LLVMMarcin OstaszEuropean Technology Platform for High Performance ComputingBarcelona Supercomputing CenterEuropean Exascale Projects and Their Global ContributionsMatthew OttenCornell UniversityThe OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Kaiming OuyangUniversity of California, RiversideCorrecting Soft Errors Online in Fast Fourier TransformKalin OvtcharovMicrosoftAccelerating Deep Neural Networks at Datacenter Scale with the BrainWave ArchitectureReturn to TopPHans PabstIntel CorporationP31: Understanding the Performance of Small Convolution Operations for CNN on Intel ArchitectureNikhil PadmanabhanYale UniversityCosmological Particle-Mesh Simulations in ChapelGlenn PageSustainaMetrixMultidisciplinary Education on Big Data + HPC + Atmospheric SciencesScott PakinLos Alamos National LaboratoryModeling UGAL on the Dragonfly TopologyBrian PanH3 Platform IncTowards a Composable Computer SystemDhabaleswar PandaOhio State UniversityESPM2'17: Opening RemarksESPM2'17: Closing RemarksAn In-Depth Performance Characterization of CPU- and GPU-Based DNN Training on Modern ArchitecturesScalable Reduction Collectives with Data Partitioning-Based Multi-Leader DesignInfiniBand, Omni-Path, and High-Speed Ethernet: Advanced Features, Challenges in Designing, HEC Systems and UsageInfiniBand, Omni-Path, and High-Speed Ethernet for DummiesBig Data Meets HPC: Exploiting HPC Technologies for Accelerating Big Data Processing and ManagementBuilding Efficient Clouds for HPC, Big Data, and Deep Learning Middleware and ApplicationsAccelerating Big Data Processing and Machine/Deep Learning Middleware on Modern HPC ClustersESPM2 2017: Third International Workshop on Extreme Scale Programming Models and MiddlewareJean-Pierre PanzieraEuropean Technology Platform for High Performance ComputingAtosEuropean Exascale Projects and Their Global ContributionsThomas PapatheodoreOak Ridge National LaboratoryP26: Optimizing Gravity and Nuclear Physics in FLASH for ExascaleMichael E. PapkaArgonne National LaboratoryFlexible In Situ Visualization of LAMMPS SimulationsA Path from Serial Execution to Hybrid Parallelization for Learning HPCParallel Streaming for In Transit Analysis with Heterogeneous Data LayoutScalable In Situ Analysis of Molecular Dynamics SimulationsManish ParasharRutgers UniversityExtreme Scale Data Management for In-Situ Scientific WorkflowsSubmarine: A Subscription-Based Data Streaming Framework for Integrating Large Facilities and Advanced CyberinfrastructureExperiencing HPC for Undergraduates: Introduction to HPC ResearchDevangi N. ParikhUniversity of TexasLowering Barriers into HPC through Open EducationChanyoung ParkUniversity of FloridaMulti-Fidelity Surrogate Modeling for Application/Architecture Co-DesignJunghyun ParkKorea Institute of Science and Technology InformationVisualization of Decision-Making Support (DMS) Information for Responding to a Typhoon-Induced DisasterKyongseok ParkKorea Institute of Science and Technology InformationP51: TuPiX-Flow: Workflow-Based Large-Scale Scientific Data Analysis SystemScott ParkerArgonne National LaboratoryRun-to-Run Variability on Xeon Phi Based Cray XC SystemsAlfredo Parra HinojosaTechnical University MunichA Highly Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma SimulationsMark ParsonsUniversity of EdinburghHPC Impact Showcase: Energy and ClimateCarlo PascoeUniversity of FloridaA FPGA-Pipelined Approach for Accelerated Discrete-Event Simulation of HPC SystemsValerio PascucciUniversity of UtahFlexible In Situ Visualization of LAMMPS SimulationsIgor PasichnykIBMP08: Performance Optimization of Matrix-free Finite-Element Algorithms within deal.IIJohn PatchettLos Alamos National LaboratoryLarge Scale Visualization with ParaViewTirthak PatelNortheastern UniversityFailures in Large Scale Systems: Long-Term Measurement, Analysis, and ImplicationsOnkar PatilNorth Carolina State UniversityA28: Exploring Use Cases for Non-Volatile Memories in Support of HPC ResilienceTapasya PatkiLawrence Livermore National LaboratoryP94: Fully Hierarchical Scheduling: Paving the Way to Exascale WorkloadsAbani K. PatraUniversity at BuffaloA Slurm Simulator: Implementation and Parametric AnalysisKarthik PattabiramanUniversity of British ColumbiaUnderstanding Error Propagation in Deep Learning Neural Network (DNN) Accelerators and ApplicationsMichael PattersonIntel CorporationEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Robert M. PattersonNational Center for Supercomputing Applications, University of IllinoisMilky Way Analogue Isolated Disk GalaxyFirst Light in the Renaissance Simulation Visualization: Formation of the Very First Galaxies in the UniverseRobert M. PattonOak Ridge National LaboratoryIntroduction - Machine Learning in HPC EnvironmentsMd Mostofa PatwaryIntel CorporationDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesSri Raj paulRice UniversityChapel-on-X: Exploring Tasking Runtimes for PGAS LanguagesRobert S. PavelLos Alamos National LaboratoryP20: Facilitating the Scalability of ParSplice for Exascale TestbedsGrzegorz PawelczakUniversity of BristolP96: Correcting Detectable Uncorrectable Errors in MemoryDavid PearahHDF GroupThe HDF5 DataverseRoger PearceLawrence Livermore National LaboratoryToward Scalable Parallel Training of Deep Neural NetworksCarl PearsonUniversity of IllinoisP16: Scaling Analysis of a Hierarchical Parallelization of Large Inverse Multiple-Scattering SolutionsKevin PedrettiSandia National LaboratoriesState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)P90: Global Survey of Energy and Power-Aware Job Scheduling and Resource Management in Supercomputing CentersJim PeekMissing Link ElectronicsHeterogeneous Multi-Processing in Software-Defined Cloud Storage NodesPaul Peltz JrLos Alamos National LaboratoryHPC Systems Professionals WorkshopZong PengIndiana UniversityReliable Access to Massive Restricted Texts: Experience-Based EvaluationSwann PerarneauArgonne National LaboratoryExperiencing HPC for Undergraduates: Careers in HPCGuilherme Peretti-PezziSwiss National Supercomputing CentreRegression Testing and Monitoring ToolsOlga PerevalovaUniversity of HamburgP57: Adaptive Tier Selection for NetCDF and HDF5Danny PerezLos Alamos National LaboratoryGaining Insights into the Properties of Materials Using Atomistic Simulations on Large-Scale HPC PlatformsP20: Facilitating the Scalability of ParSplice for Exascale TestbedsDavid Perez-SuarezUniversity College LondonSoftware Engineers: Careers in ResearchChris PerssonUniversity West SwedenP44: Increasing Throughput of Multiprogram HPC Workloads: Evaluating a SMT Co-Scheduling ApproachBradley PetersonUniversity of UtahScientific Computing and Imaging InstituteAddressing Global Data Dependencies in Heterogeneous Asynchronous Runtime Systems on GPUsSerge PetitonUniversity of LilleMaison de la SimulationParallel Jaccard and Related Graph Clustering TechniquesRuntime Correctness Checking for Emerging Programming ParadigmsFabrizio PetriniIntel CorporationHPC Graph Toolkits and the GraphBLAS ForumAntonio J. PeñaBarcelona Supercomputing Center, Polytechnic University of CataloniaGPUs and CommunicationDavid PfanderUniversity of StuttgartP77: AutoTuneTMP: Auto Tuning in C++ With Runtime Template MetaprogrammingDirk PflügerUniversity of StuttgartA Highly Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma SimulationsP77: AutoTuneTMP: Auto Tuning in C++ With Runtime Template MetaprogrammingFranz-Josef PfreundtFraunhofer Institute for Industrial MathematicsTensorQuant - A Simulation Toolbox for Deep Neural Network QuantizationVinanti PhadkeHewlett Packard EnterprisePowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlAnusha PhadnisDhirubhai Ambani Institute of Information and Communication TechnologyP27: Parallelization of the Particle-In-Cell Monte Carlo Collision (PIC-MCC) Algorithm for Plasma Simulation on Intel MIC Xeon Phi ArchitectureThe Anh PhamFrench Institute for Research in Computer Science and Automation (INRIA)ENS RennesVerifying MPI Applications with SimGridMCCynthia A. PhillipsSandia National LaboratoriesIntroduction - Workshop on Education for High Performance Computing (EduHPC)Everett PhillipsNvidia CorporationA Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU SystemsLaercio PillaFederal University of Santa CatarinaExperimental and Analytical Study of Xeon Phi ReliabilitySergio Pino GallardoUniversity of DelawareOpenMP 4.5 Validation and Verification SuiteBeth PlaleIndiana UniversityReliable Access to Massive Restricted Texts: Experience-Based EvaluationDirk PleiterForschungszentrum JuelichThe OLCF GPU Hackathon Series: The Story Behind Advancing Scientific Applications with a Sustained ImpactThoughts on the Path Toward Exascale from a JSC PerspectiveBest Practices for Architecting Performance and Capacity in the Burst Buffer EraApplication Porting and Optimization on GPU-Accelerated POWER ArchitecturesSteve PlimptonSandia National LaboratoriesFernbach Award Presentation: Particles, HPC, and the Ukulele SyndromeNorbert PodhorszkiOak Ridge National LaboratoryData Analysis of Earth System Simulation within an In Situ InfrastructureArtur PodobasTokyo Institute of TechnologyP41: OpenCL-Based High-Performance 3D Stencil Computation on FPGAsJames PoggeTennessee Technological UniversityP59: Secure Enclaves: An Isolation-Centric Approach for Creating Secure High-Performance Computing EnvironmentsMartin PokornyNational Radio Astronomy Observatoryrealfast@VLAJorda PoloBarcelona Supercomputing CenterTopology-Aware GPU Scheduling for Learning Workloads in Cloud EnvironmentsArtem PolyakovMellanox TechnologiesCharting the PMIx RoadmapDuncan PooleNvidia CorporationOpenACC API User Experience, Vendor Reaction, Relevance, and RoadmapSteve PooleLos Alamos National LaboratoryOpenSHMEM in the Era of ExascaleExperiencing HPC for Undergraduates: Careers in HPCSwaroop PophaleOak Ridge National LaboratoryOpenMP 4.5 Validation and Verification SuiteVasileios PorpodasIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationAllan PorterfieldUniversity of North CarolinaQUARC: An Optimized DSL Framework Using LLVMDouglass E. PostUS Department of Defense HPC Modernization ProgramAccelerating Innovation of Defense Systems with Computational Prototypes and High Performance ComputingAlex PothenPurdue UniversityHPC Graph Toolkits and the GraphBLAS ForumCourtney PowellHokkaido UniversityP61: Cloud Resource Selection Based on PLS Method for Deploying Optimal Infrastructures for Genomic Analytics ApplicationMichael M. PozulpLawrence Livermore National LaboratoryP79: Porting the Opacity Client Library to a CPU-GPU Cluster Using OpenMP 4.5Mr PrabhatLawrence Berkeley National LaboratoryDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataGalactos: Computing the 3-pt Anisotropic Correlation for 2 Billion GalaxiesSushil K. PrasadGeorgia State UniversityIntroduction - Workshop on Education for High Performance Computing (EduHPC)Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsCommon Big Data Challenges in Bio, Geo, Climate, and Social SciencesJames PriceUniversity of BristolP69: Portable Methods for Measuring Cache Hierarchy PerformanceReid PriedhorskyLos Alamos National LaboratoryCharliecloud: Unprivileged Containers for User-Defined Software Stacks in HPCContainers in HPCHoward PritchardLos Alamos National LaboratoryGraph500 on OpenSHMEM: Using a Practical Survey of Past Work to Motivate Novel Algorithmic DevelopmentsW. Cyrus ProctorTexas Advanced Computing Center, University of TexasSecuring HPC: Development of a Low Cost, Open Source, Multi-Factor Authentication InfrastructureRoberto ProiettiUniversity of California, DavisSilicon Photonic LIONS: All-to-All Interconnects for Energy-Efficient, Scalable, and Modular HPC SystemsP50: Energy-Efficient and Scalable Bio-Inspired Nanophotonic ComputingP49: Toward Exascale HPC Systems: Exploiting Advances in High Bandwidth Memory (HBM2) through Scalable All-to-All Optical Interconnect ArchitecturesAndrea ProsperettiUniversity of HoustonVistas in Advanced ComputingJoachim ProtzeRWTH Aachen UniversityRuntime Correctness Checking for Emerging Programming ParadigmsSpencer R. PruittWorcester Polytechnic InstituteP32: Exploring the Performance of Electron Correlation Method Implementations on Kove XPDsDavid PugmireOak Ridge National LaboratoryScalable HPC Visualization and Data Analysis Using VisItShweta PurawatSan Diego Supercomputer CenterA Machine Learning Approach for Modular Workflow Performance PredictionSatish PuriMarquette UniversityP19: MPI-GIS: An MPI System for Big Spatial DataMilos PuzovicHartree CentreState of the Practice: Energy and Power Aware Job Scheduling and Resource Management (EPA-JSRM)Philippe P. PébaÿSandia National LaboratoriesA Novel Shard-Based Approach for Asynchronous Many-Task Models for In Situ AnalysisLean Visualization of Large Scale Tree-Based AMR MeshesReturn to TopQDepei QianBeihang UniversitySun Yat-Sen UniversityChina’s New HPC Key ProjectYingjin QianDataDirect NetworksA Configurable Rule-Based Classful Token Bucket Filter Network Request Scheduler for the Lustre File SystemYang QiaoDelft University of TechnologyAdopting OpenCAPI for High Bandwidth Database AcceleratorsZhi QiaoUniversity of North TexasP55: Incorporating Proactive Data Rescue into ZFS Disk Recovery for Enhanced Storage ReliabilityJudy QiuIndiana UniversityTeaching, Learning and Collaborating through Cloud Computing Online ClassesHarp-DAAL: A Next Generation Platform for High Performance Machine Learning on HPC-CloudIrene QualtersNational Science FoundationNational Strategic Computing Initiative UpdateHeather QuinnLos Alamos National LaboratoryExperimental and Analytical Study of Xeon Phi ReliabilityMartin QuinsonENS RennesVerifying MPI Applications with SimGridMCEnrique S. Quintana-OrtiJaume I UniversityFlexible Batched Sparse Matrix-Vector Product on GPUsReturn to TopRCarolyn RaabCorsa TechnologyProtecting against Hyper Scale Network Attacks with Bump-in-the-Wire 100G filteringEvan RacahLawrence Berkeley National LaboratoryDeep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific DataKen RaffenettiArgonne National LaboratoryWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1MPICH: A High-Performance Open-Source MPI ImplementationBruno RAFFINFrench Institute for Research in Computer Science and Automation (INRIA)Melissa: Large Scale In Transit Global Sensitivity Analysis Avoiding Intermediate FilesPadma RaghavanVanderbilt UniversityInvited Talks 2Md Shafayat RahmanFlorida State UniversityModeling UGAL on the Dragonfly TopologyIoan RaicuIllinois Institute of Technology, Argonne National LaboratorySoftware for HPC FacilitiesSwapna RajIntel CorporationLESS: Loop Nest Execution Strategies for Spatial ArchitecturesSivasankaran RajamanickamSandia National LaboratoriesDesigning Vector-Friendly Compact BLAS and LAPACK KernelsBatched, Reproducible, and Reduced Precision BLASEspen Birger RaknesAker BP ASATowards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core ArchitecturesVinay B. RamakrishnaiahUniversity of WyomingP20: Facilitating the Scalability of ParSplice for Exascale TestbedsKarthik RamanIntel CorporationP13: Large-Scale GW Calculations on Pre-Exascale HPC SystemsJ. RamanujamLouisiana State UniversityWOLFHPC: Workshop on Domain-Specific Languages and High-Level Frameworks for High-Performance ComputingHPX Smart ExecutorsMouad RamilNational School of Bridges and Roads - ParisTechP20: Facilitating the Scalability of ParSplice for Exascale TestbedsTim RandlesLos Alamos National LaboratoryCharliecloud: Unprivileged Containers for User-Defined Software Stacks in HPCOpenStack For HPC: Best Practices for Optimizing Software-Defined InfrastructureContainers in HPCArvind RaoSan Diego Supercomputer CenterA Machine Learning Approach for Modular Workflow Performance PredictionGil RapaportIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationGeorg RathLawrence Berkeley National LaboratoryTracking and Analyzing Job-level Activity Using Open XDMoD, XALT and OGRTThilina RathnayakeUniversity of IllinoisWhy Is MPI So Slow? Analyzing the Fundamental Limits in Implementing MPI-3.1Tyler S. RauLos Alamos National LaboratoryP54: Investigating Hardware Offloading for Reed-Solomon EncodingArchana RavindarIBMApplication Porting and Optimization on GPU-Accelerated POWER ArchitecturesNavamita RayLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesPablo RebleIntel CorporationExpressing Heterogeneous Parallelism in C++ with Intel Threading Building BlocksPaolo RechFederal University of Rio Grande do SulAnalyzing the Criticality of Transient Faults-Induced SDCs on GPU ApplicationsExperimental and Analytical Study of Xeon Phi ReliabilityDaniel ReedUniversity of IowaCommon Big Data Challenges in Bio, Geo, Climate, and Social SciencesEnergy Efficiency Gains From Software: Retrospectives and PerspectivesIstvan Zoltan RegulyPazmany Peter Catholic UniversityComparison of Parallelization Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUsBeyond 16GB: Out-of-Core Stencil ComputationsP01: Cache-Blocking Tiling of Large Stencil Codes at RuntimeJames ReindersJames Reinders Consulting LLCExpressing Heterogeneous Parallelism in C++ with Intel Threading Building BlocksSteven K. ReinhardtMicrosoftGravel: Fine-Grain GPU-Initiated Network MessagesGPU Triggered Networking for Intra-Kernel CommunicationsNico ReissmannNorwegian University of Science and TechnologyToward Aggregated Grain GraphsTowards Fine-Grained Dynamic Tuning of HPC Applications on Modern Multi-Core ArchitecturesSeverin ReizTechnical University MunichGeometry-Oblivious FMM for Compressing Dense SPD MatricesLuc RenambotUniversity of Illinois, ChicagoSAGE2 9th Annual International SC BOF: Scalable Amplified Group Environment for Global CollaborationArnaud RenardUniversity of Reims Champagne-ArdenneP64: romeoLAB : HPC Training Platform on HPC facilityVasudevan RengasamyPennsylvania State UniversityOptimizing Word2Vec Performance on Multicore SystemsSebastian RettenbergerTechnical University MunichExtreme Scale Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust EarthquakeAlejandro RibesElectricity of France (EDF)Keynote: Computing Ubiquitous Statistics: Computational ChallengesMelissa: Large Scale In Transit Global Sensitivity Analysis Avoiding Intermediate FilesMichael RiceIntel CorporationLLVM Compiler Implementation for Explicit Parallelization and SIMD VectorizationMorris RiedelResearch Center JuelichSupporting Software Engineering Practices in the Development of Data-Intensive HPC Applications with the JuML FrameworkLorna RiveraGeorgia Institute of TechnologyWhy Subtle Bias is Often Worse than Blatant DiscriminationPanel Discussion: Diversifying the HPC workforceIntroduction - Women in HPC: Diversifying the HPC CommunityFrom Outreach to Education to Researcher - Innovative Ways of Expanding the HPC CommunityRecruitment: How to Build Diverse TeamsSilvio RizziArgonne National LaboratoryFlexible In Situ Visualization of LAMMPS SimulationsParallel Streaming for In Transit Analysis with Heterogeneous Data LayoutIn Situ Analysis and Visualization with SENSEIYves RobertFrench Institute for Research in Computer Science and Automation (INRIA)Resilient N-Body Tree Computations with Algorithm-Based Focused Recovery: Model and Performance AnalysisBudget-Aware Scheduling Algorithms for Scientific Workflows on IaaS Cloud PlatformsFault-Tolerance for High Performance and Distributed Computing: Theory and PracticeJames RobnettNational Radio Astronomy Observatoryrealfast@VLAMichael RobsonUniversity of IllinoisMigratable Objects and Task-Based Parallel Programming with Charm++Ivan RoderoRutgers UniversitySubmarine: A Subscription-Based Data Streaming Framework for Integrating Large Facilities and Advanced CyberinfrastructureJames RogersOak Ridge National LaboratoryEighth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)Total Cost of Ownership and HPC System ProcurementMike L. RogersLos Alamos National LaboratoryP07: PORTAGE - A Flexible Conservative Remapping Framework for Modern HPC ArchitecturesGeorgios RokosIBMImplementing Implicit OpenMP Data Sharing on GPUsJoshua RomeroNvidia CorporationA Performance Study of Quantum ESPRESSO's PWscf Code on Multi-Core and GPU SystemsTodd RosedahlIBMPowerAPI, GEOPM and Redfish: Open Interfaces for Power/Energy Measurement and ControlArnold L. RosenbergNortheastern UniversityIntroduction - Workshop on Education for High Performance Computing (EduHPC)Revisions to NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computing (PDC) for Undergraduate Education - Updates on the Curriculum Revision and Audience CommentsRob RossArgonne National LaboratoryParallel I/O in PracticeBarry RountreeLawrence Livermore National LaboratoryPerformance Modeling under Resource Constraints Using Deep Transfer LearningPower-Aware High Performance Computing: Challenges and Opportunities for Application and System DevelopersDamian RousonSourcery InstitutePGAS Applications Workshop PanelPerformance Portability of an Intermediate-Complexity Atmospheric Research Model in Coarray FortranCindy Rubio GonzálezUniversity of California, DavisIntroduction - 1st International Workshop on Software Correctness for HPC Applications (Correctness 2017)Cindy Rubio-GonzalezUniversity of California, DavisCorrectness 2017: First International Workshop on Software Correctness for HPC ApplicationsAndy RudoffIntel CorporationInvited Talk: Persistent Memory: The Value to HPC and the Challeng