Tamira M. Moon
Michael Bagel
Drew Matter
Drew Matter leads Mikros Technologies, a designer and manufacturer of best-in-class direct liquid cold plates for AI/HPC, semiconductor testing, laser & optics, and power electronics. Mikros provides leading microchannel thermal solutions in single-phase, 2-phase, DLC and immersion systems to leading companies around the world.
Steve Mills
Steve Mills is a Mechanical Engineer who has dedicated over 25 years to the development of IT hardware in the enterprise and hyperscale space. After tours at DELL and Storspeed, he joined Meta in 2012 and is currently a Technical Lead for Data Center and Hardware Interfaces. He also serves on the Open Compute Project Steering Committee representing the Cooling Environments Project. He has 48 US patents and is an author of eight papers covering the packaging and cooling of electronics.
Matt Archibald
Matt Archibald is the Director of Technical Architecture at nVent supporting the data center and networking space. Matt is deeply focused on liquid cooling (close-coupled and direct-to-chip), unified infrastructure management, data center monitoring, and automated data center infrastructure management.
Vinod Kamath
Mikros Technologies
Website: https://www.mikrostechnologies.com/
Mikros Technologies provides industry leading liquid cooling to a variety of HPC and Data Center markets and is considered the best-in-class solution by next-gen chip designers. Their high-effectiveness heat transfer empowers designers to improve the performance, packaging and reliability of a wide range of complex systems. At Mikros Technologies, their liquid cooling options offer low-pressure drops and low flow rates, so you can enjoy superior performance while consuming less energy. Mikros Technologies liquid cooling solutions can help your data center meet operating goals and creates more power bandwidth for chip designers, AI algorithms and more.
For an organization to make effective use of an AI cluster, it is important to consider the entire process of designing, building, deploying and managing the resource. At each step, a cluster for AI presents new and different challenges that even experienced IT team members may not have encountered before. In this presentation, Penguin Solutions CTO Philip Pokorny will explore AI clusters from design to daily management and will speak to:
- Key considerations when designing an AI cluster
- Important areas that can compromise AI cluster performance
- Ways that software solutions like Penguin's unique Scyld ClusterWare can address complexities
- How to ensure maximum value from your AI cluster investment
Phil Pokorny
Phil Pokorny is the Chief Technology Officer (CTO) for SGH / Penguin Solutions. He brings a wealth of engineering experience and customer insight to the design, development, support, and vision for our technology solutions.
Phil joined Penguin in February of 2001 as an engineer, and steadily progressed through the organization, taking on more responsibility and influencing the direction of key technology and design decisions. Prior to joining Penguin, he spent 14 years in various engineering and system administration roles with Cummins, Inc. and Cummins Electronics. At Cummins, Phil participated in the development of internal network standards, deployed and managed a multisite network of multiprotocol routers, and supported a diverse mix of office and engineering workers with a variety of server and desktop operating systems.
He has contributed code to Open Source projects, including the Linux kernel, lm_sensors, and LCDproc.
Phil graduated from Rose-Hulman Institute of Technology with Bachelor of Science degrees in math and electrical engineering, with a second major in computer science.
Penguin Solutions
Website: https://www.penguinsolutions.com/
Penguin Solutions designs, builds, deploys, and manages AI and accelerated computing infrastructures at scale. With 25+ years of HPC experience – and more than 75,000 GPUs deployed and managed to date – Penguin is a trusted strategic partner for AI and HPC solutions and services for leading organizations around the world.
Designing, deploying, and operating “AI factories” is an incredibly complex endeavor and Penguin has successfully been delivering AI factories at scale since 2017. The company’s OriginAI infrastructure, which is backed by Penguin's specialized intelligent cluster management software and expert services, streamlines AI implementation and management, and enables predictable AI cluster performance that supports customers’ business needs and return on investment goals for clusters small or large, ranging in size from hundreds to thousands of GPUs.
The OriginAI solution builds on Penguin’s extensive AI infrastructure expertise to reduce complexity and accelerate return on investment, providing CEOs and CIOs alike the essential and reliable infrastructure they need to deploy and manage demanding AI workloads at scale in the data center and at the edge. To learn more visit their website at: https://www.penguinsolutions.com. Follow Penguin Solutions on LinkedIn, Twitter, YouTube, and Facebook.
Sanchit Juneja
Sanchit Juneja has 18+ years of Tech Leadership Experience in tech and product roles across The US, South-east Asia, Africa, South-east Asia, and Europe with organizations such as Booking.com, AppsFlyer, GoJek, Rocket Internet, and National Instruments. Currently Director- Product (Big Data & ML/AI) with Booking.com