Ph.D. Dissertation

Optimization of Communication Intensive Applications on HPC Networks. Illinois Research and Technical Reports - Computer Science (CS Res. & Tech. Report) 2016.

Software

  • TraceR: a configurable HPC runtime environment and application simulator.
  • CODES: a packet-level optimistic PDES-based network simulator.
  • Damselfly: a tool for modeling steady state of dragonfly networks.
  • BGQ-NCL: an easy interface for network counter collection on BG/Q systems.
  • Charm++: an adaptive parallel programming paradigm and runtime system.
  • OpenAtom: a scalable Ab-Initio Molecular Dynamics software.
  • EpiSimdemics: a hybrid contagion modeling software.
  • NAMD: a highly scalable parallel molecular dynamics code.
  • Charm ROSS: Charm++ based implementation of the parallel discrete event simulator ROSS.

Peer-reviewed Journal/Conference Publications

  1. Samuel D. Pollard, Nikhil Jain, Stephen Herbein, and Abhinav Bhatele. Evaluation of an interference-free node allocation policy on fat-tree clusters. SC 2018.
  2. Staci A. Smith, Clara Cromey, David K. Lowenthal, Jens Domke, Nikhil Jain, Jayaraman J. Thiagarajan, and Abhinav Bhatele. Mitigating inter-job interference using adaptive flow-aware routing. SC 2018.
  3. Harsh Bhatia, Nikhil Jain, Abhinav Bhatele, Yarden Livnat, Jens Domke, Valerio Pascucci, and Peer-Timo Bremer. Interactive investigation of traffic congestion on fat-tree networks using TreeScope. Euro-Vis 2018, published in Computer Graphics Forum.
  4. Kevin A. Brown, Nikhil Jain, Satoshi Matsuoka, Martin Schulz, and Abhinav Bhatele. Interference between I/O and MPI traffic on fat-tree networks. ICPP 2018.
  5. Jayaraman J. Thiagarajan, Nikhil Jain, Rushil Anirudh, Alfredo Giménez, Rahul Sridhar, Aniruddha Marathe, Tao Wang, Murali Emani, Abhinav Bhatele, and Todd Gamblin. Bootstrapping parameter space exploration for fast tuning. ICS 2018.
  6. Jayaraman Thiagarajan, Rushil Anirudh, Bhavya Kailkhura, Nikhil Jain, Tanzima Islam, Abhinav Bhatele, Jae-Seung Yeom, and Todd Gamblin. PADDLE: Performance Analysis using a Data-driven Learning Environment. IPDPS 2018.
  7. Nikhil Jain, Abhinav Bhatele, Louis Howell, David Bohme, Ian Karlin, Edgar Leon, Misbah Mubarak, Noah Wolfe, Todd Gamblin, and Matthew Leininger. Predicting the Performance Impact of Different Fat-tree Configurations. SC 2017.
  8. Aniruddha Marathe, Rushil Anirudh, Nikhil Jain, Abhinav Bhatele, Jayaraman Thiagarajan, Bhavya Kailkhura, Jae-Seung Yeom, Barry Rountree, and Todd Gamblin. Performance Modeling Under Resource constraints using Deep Transfer Learning. SC 2017.
  9. Misbah Mubarak, Nikhil Jain, Jens Domke, Noah Wolfe, Caitlin Ross, Jianping Li, Abhinav Bhatele, Christopher D. Carothers, Kwan-Liu Ma, and Robert B. Ross. Toward Reliable Validation of HPC Interconnect Simulations. Winter Simulation Conference 2017.
  10. Xiang Ni, Nikhil Jain, Kavitha Chandrasekar, and Laxmikant Kale. Runtime Techniques for Programming with Fast and Slow Memory. Cluster 2017.
  11. Misbah Mubarak, Philip Carns, Jonathan Jenkins, Jianping Li, Nikhil Jain, Shane Snyder, Robert B. Ross, Christopher D. Carothers, Abhinav Bhatele, and Kwan-Liu Ma. Quantifying I/O and Communication Traffic Interference on Dragonfly Networks Qquipped with Burst Buffers. Cluster 2017.
  12. Juan J. Galvez, Nikhil Jain, and Laxmikant V. Kale. Automatic Topology Mapping of Diverse Large-scale Parallel Applications. ICS 2017.
  13. Nikhil Jain, Abhinav Bhatele, Xiang Ni, Todd Gamblin, and Laxmikant V. Kale. Partitioning low-diameter networks to eliminate inter-job interference. IPDPS 2017.
  14. Abhinav Bhatele, Jae-Seung Yeom, Nikhil Jain, Chris J. Kuhlman, Yarden Livnat, Keith R. Bisset, Laxmikant V. Kale, and Madhav V. Marathe. Massively parallel simulations of spread of infectious diseases over realistic social networks. CCGrid 2017 Scale Challenge.
  15. Noah Wolfe, Misbah Mubarak, Nikhil Jain, Jens Domke, Abhinav Bhatele, Christopher D. Carothers, and Robert B. Ross. Preliminary Performance Analysis of Multi-rail Fat-tree Networks. CCGrid 2017.
  16. Nikhil Jain, Abhinav Bhatele, Sam White, Todd Gamblin, and Laxmikant Kale. Evaluating HPC Networks via Simulation of Parallel Workloads. SC 2016.
  17. Nikhil Jain, Eric Bohm, Eric Mikida, Subhasish Mandal, Minjung Kim, Prateek Jindal, Qi Li, Sohrab Ismail-Beigi, Glenn J. Martyna, and Laxmikant V. Kale. OpenAtom: Scalable Ab-Initio Molecular Dynamics with Diverse Capability. ISC HPC 2016.
  18. Abhinav Bhatele, Nikhil Jain, Yarden Livnat, Valerio Pascucci, and Peer-Timo Bremer. Analyzing network health and congestion in dragonfly-based systems. IPDPS 2016.
  19. Eric Mikida, Nikhil Jain, Elsa Gonsiorowski, Peter D. Barnes, Jr., David Jefferson, Christopher Carothers, and Laxmikant Kale. Towards PDES in a Message-Driven Paradigm: A Preliminary Case Study Using Charm++. SIGSIM PADS 2016.
  20. Nikhil Jain, Abhinav Bhatele, Jae-Seung Yeom, Mark F. Adams, Francesco Miniati, Chao Mei, Laxmikant Kale. Charm++ & MPI: Combining the Best of Both Worlds. IPDPS 2015.
  21. Abhinav Bhatele, Andrew Titus, Jayaraman Thiagarajan, Nikhil Jain, Todd Gamblin, Peer-Timo Bremer, Martin Schulz, Laxmikant Kale. Identifying the Culprits behind Network Congestion. IPDPS 2015.
  22. Ehsan Totoni, Nikhil Jain, Laxmikant Kale. Power Management of Extreme-scale Networks with On/Off Links in Runtime Systems. TOPC 2014, Journal.
  23. Abhinav Bhatele, Nikhil Jain, Katherine E. Isaacs, Ronak Buch, Todd Gamblin, Steven H. Langer, Laxmikant V. Kale. Improving Application Performance via Task Mapping on IBM Blue Gene/Q. HiPC 2014.
  24. Nikhil Jain, Abhinav Bhatele, Xiang Ni, Nicholas J. Wright, Laxmikant Kale. Maximizing Network Throughput on the Dragonfly Interconnect. SC 2014.
  25. Bilge Acun, Abhishek Gupta, Nikhil Jain, Akhil Langer, Harshitha Menon, Eric Mikida, Xiang Ni, Michael Robson, Yanhua Sun, Ehsan Totoni, Lukasz Wesolowski, Laxmikant Kale. Parallel Programming with Migratable Objects: Charm++ in Practice. SC 2014.
  26. James Phillips, Yanhua Sun, Nikhil Jain, Eric J. Bohm, Laxmikant Kale. Mapping to Irregular Torus Topologies and Other Techniques for Petascale Biomolecular Simulation. SC 2014.
  27. Nikhil Jain, Abhinav Bhatele, Michael Robson, Laxmikant Kale. Predicting application performance using supervised learning on communication features. SC 2013.
  28. Xiang Ni, Esteban Meneses, Nikhil Jain, Laxmikant Kale. ACR: Automatic Checkpoint/Restart for Soft and Hard Error Protection. SC 2013.
  29. Nikhil Jain, JohnMark Lau, Laxmikant Kale. Collectives on Two-tier Direct Networks. EuroMPI 2012.
  30. Harshitha Menon, Nikhil Jain, Gengbin Zheng, Laxmikant Kale. Automated Load Balancing Invocation based on Application Characteristics. Cluster 2012.
  31. Anshul Mittal, Nikhil Jain, Thomas George, Yogish Sabharwal, Sameer Kumar. Collective Algorithms for Sub-communicators. ICS 2012.
  32. Abhinav Bhatele, Nikhil Jain, William Gropp and Laxmikant Kale. Avoiding hot-spots on two-level direct networks. SC 2011.
  33. Ehsan Totoni, Abhinav Bhatele, Eric Bohm, Nikhil Jain, Celso Mendes, Ryan Mokos, Gengbin Zheng and Laxmikant Kale. Simulation-based Performance Analysis and Tuning for a Two-level Directly Connected System. ICPADS 2011.
  34. Anshul Mittal, Jagobondhu Hazra, Nikhil Jain, Vivek Goyal, Deva Seetharam and Yogish Sabharwal. Real Time Contingency Analysis for Power Grids. Euro-Par 2011.
  35. Nikhil Jain and Yogish Sabharwal. Optimal Bucket Algorithms for large MPI Collectives on Torus Interconnect. ICS 2010.
  36. Venkatesan Chakaravarthy, Nikhil Jain and Yogish Sabharwal. Optimizing Matrix Transpose on Torus Interconnects. Euro-Par 2010.

Peer-reviewed Workshop Publications

  1. Bilge Acun, Nikhil Jain, Abhinav Bhatele, Misbah Mubarak, Christopher Carothers, Laxmikant Kale. Preliminary Evaluation of a Parallel Trace Replay Tool for HPC Network Simulations. Workshop on Parallel and Distributed Agent-Based Simulations at EURO-PAR 2015.
  2. Ehsan Totoni, Nikhil Jain, Laxmikant Kale. Toward Runtime Power Management of Exascale Networks by On/Off Control of Links. Workshop on Higher Peformance Power Aware Computing at IPDPS 2013.
  3. Laxmikant Kale, Nikhil Jain, Akhil Langer, Esteban Meneses, Phil Miller, Osman Sarood, Ehsan Totoni. A Multi-resolution Emulation + Simulation Methodology. Position paper at Workshop on Modeling & Simulation of Exascale Systems & Applications 2013.
  4. Laxmikant Kale, Osman Sarood, Eric Bohm, Nikhil Jain, Akhil Langer, Esteban Meneses. Action- able Performance Modeling for Future Supercomputers. Position paper at Workshop on Modeling & Simulation of Exascale Systems & Applications 2013.
  5. Nikhil Jain, Brajesh Pande and Phalguni Gupta. SMP Based Solver for Large Binary Systems. The Tenth International Workshop on Parallel and Distributed Algorithms and Applications 2009.

Posters

  • Model-Agnostic Influence Analysis for Performance Data. SC 2017.
  • Improving Performance of Networks & Applications using Simulations. Computation, LLNL Postdoc Poster Session 2017. (2nd Place).
  • Simulating and visualizing traffic on the dragonfly network. SC 2015.
  • Interoperating MPI and Charm++ for Productivity and Performance. SC 2014.
  • Fast Prediction of Network Performance: k-packet Simulation. SC 2013. (Silver Award).
  • Understanding Network Contention on Blue Gene Supercomputers. LLNL Student Poster Session 2012. (Best Poster Award).
  • Collective Algorithms for Sub-communicators. PPoPP 2012.