Daniel Nichols

Orcid: 0000-0002-3538-6164

According to our database1, Daniel Nichols authored at least 29 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Performance-Aligned LLMs for Generating Fast HPC Code.
IEEE Trans. Parallel Distributed Syst., June, 2026

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning.
CoRR, April, 2026

Record-Remix-Replay: Hierarchical GPU Kernel Optimization using Evolutionary Search.
CoRR, April, 2026

Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training.
CoRR, April, 2026

2025
Optimizing Agentic Language Model Inference via Speculative Tool Calls.
CoRR, December, 2025

LLMs as Packagers of HPC Software.
CoRR, November, 2025

Integrating Performance Tools in Model Reasoning for GPU Kernel Optimization.
CoRR, October, 2025

Modeling Code: Is Text All You Need?
CoRR, July, 2025


HPC-Coder-v2: Studying Code LLMs Across Low-Resource Parallel Languages.
Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025

The Shortcomings of Code LLMs in Modeling Code Properties.
Proceedings of the IEEE/ACM International Workshop on Large Language Models for Code, 2025

ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks.
Proceedings of the 54th International Conference on Parallel Processing, 2025

2024
Performance-Aligned LLMs for Generating Fast Code.
CoRR, 2024

Automated Programmatic Performance Analysis of Parallel Programs.
CoRR, 2024

HPC-Coder: Modeling Parallel Programs using Large Language Models.
Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024

A Probabilistic Approach To Selecting Build Configurations in Package Managers.
Proceedings of the International Conference for High Performance Computing, 2024

Learning to Predict and Improve Build Successes in Package Ecosystems.
Proceedings of the 21st IEEE/ACM International Conference on Mining Software Repositories, 2024

Predicting Cross-Architecture Performance of Parallel Programs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Can Large Language Models Write Parallel Code?
Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024

Relative Performance Prediction Using Few-Shot Learning.
Proceedings of the 48th IEEE Annual Computers, Software, and Applications Conference, 2024

2023
Modeling Parallel Programs using Large Language Models.
CoRR, 2023

Porting a Computational Fluid Dynamics Code with AMR to Large-scale GPU Platforms.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
Resource Utilization Aware Job Scheduling to Mitigate Performance Variability.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
How to Train Your Neural Network: A Comparative Evaluation.
CoRR, 2021

2020
Integrating Deep Learning in Domain Sciences at Exascale.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI, 2020

2019
MagmaDNN: Accelerated Deep Learning Using MAGMA.
Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning), 2019

openDIEL: A Parallel Workflow Engine and Data Analytics Framework.
Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning), 2019

MagmaDNN: Towards High-Performance Data Analytics and Machine Learning for Data-Driven Scientific Computing.
Proceedings of the High Performance Computing, 2019

2018
Analogues of the 3 x + 1 Problem in Polynomial Rings of Characteristic 2.
Exp. Math., 2018


  Loading...