Grace Hopper Architecture Performance Analysis
Comprehensive benchmarking of large language model inference on NVIDIA DGX Spark, comparing execution environments and investigating memory behavior on unified Grace Hopper architecture.
Systematic comparison of Docker containerized execution versus native (chroot) execution across three large language models.
Extended testing with cgroup workarounds, alternative containerization methods, and deeper investigation into Grace Hopper unified memory behavior.