Connect Build

Project

Enabling AI inferencing on z/OS

Objective

To port, optimize, and deploy the open-source llama.cpp inference engine on IBM z/OS, enabling efficient AI model execution within enterprise workloads on the mainframe.

Outcome

. This effort aims to bring modern AI inference capabilities to z/OS, enabling seamless integration of machine learning models into enterprise workloads

Apply By Date	02 Jun 2025
Students	1 / 4
Duration	60 days
Mentor	Haritha D

Tools-Technologies

Platform

1 ) Mainframe

Link to be provided once students start working on a project.

College

1. PES University, Bangalore

2. PESIT

Haritha D' Comment

Environment Setup – 3 days
- Set up z/OS build environment for C++ development
- Identify and install required dependencies
Adapt llamaccp for zopen – 2 days
- Analyze llama.cpp source code for system dependencies
- Adapt llamacpp for zopen framework
Build & Compilation
- Resolve bootstrap issues – 7 days
- Resolve configure issues – 7 days
- Resolve build errors and validate successful compilation – 20 days
Enable Unit test cases
- Enable unit. Tests and integrate test results to zopen framework – 7 days
Validation
- Run sample models and validate output correctness – 3 days
- Benchmark performance – compare with zlinux and document results – 2 days
Documentation
- Provide setup, build, and usage instructions for z/OS – 1 day
- Document any platform-specific changes made to the code – 3 days