Paul Okafor

Paul Okafor

Data Scientist & AI Engineer

Computer Vision Enthusiast
AI/ML Expert
LLM/GenAI and AI Agent Engineer

Professional Summary

Paul is a highly skilled AI/LLM Engineer and Data Scientist with over five years of experience designing and deploying intelligent, end-to-end AI systems. Currently pursuing a Ph.D. in Data Science and Analytics at the University of Oklahoma, Paul’s work bridges applied AI engineering and healthcare research, focusing on building interpretable, real-time AI systems for aging and disease modeling.

Paul’s technical strengths lie in LLM fine-tuning, multi-agent system design, and AI infrastructure development. He has built production-grade, multi-agent LLM platforms using LangGraph, LangChain, and Azure AI, including a flagship conversational AI agent for Telogical Systems, which became the company’s primary interface for telecom-data analytics. His independent portfolio extends to deep-research AI agents (“Morgana”), real-time voice AI systems integrating LiveKit and FastAPI, and computer vision models leveraging YOLO for object detection.

In academia, Paul’s research applies machine learning to understand the biological and cognitive impact of chemotherapy-induced aging in pediatric cancer survivors. His work explores predictive modeling, biological age estimation, and microvascular damage detection, contributing to emerging solutions for neurocognitive decline and age-related diseases. Beyond oncology, his research extends to Cerebral Small Vessel Diseases (CSVD) and federated models for biological age prediction, underscoring his commitment to AI for health and longevity.

Skills & Expertise

AI & Machine Learning

Neural Networks
Reinforcement Learning
Deep Learning
Generative AI
Causal Inference
LLM Fine-Tuning

Cloud Architecture & MLOps

AWS SageMaker
Docker/Kubernetes
CloudFormation
Vector DBs
GPU Acceleration
CI/CD
Docker
Git

Programming

Python
Java
TypeScript
TensorFlow
PyTorch
LangChain
LangGraph
FastAPI
Flask
Next.js
Hugging Face Transformers
Scikit-Learn
CUDA
Pandas
NumPy

Data Science

Feature Engineering
Dimensionality Reduction
Clustering
Statistical Modeling
Databricks
Plotly Dash

Work Experience

Graduate Research Assistant

University of Oklahoma

Norman, OK, USA

August 2023 - Present

  • Developed cost-sensitive online learning models for control chart pattern recognition, improving real-time monitoring and anomaly detection in manufacturing processes, and significantly reducing misclassification costs.
  • Applied interpretable machine learning (IML) techniques, including SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations), to optimize recombinant protein titer production in *E. coli* fermentations, contributing to advancements in biomanufacturing.
  • Co-authored and presented research on advanced filtering techniques (Kalman Filter, Particle Filter) for titer estimation at the 2024 IEEE Big Data Conference, demonstrating improved accuracy in recombinant protein production using fermentation data.
Python
PyTorch
TensorFlow
Machine Learning
Deep Learning
Cost-Sensitive Learning
Online Learning
Control Chart Pattern Recognition
Anomaly Detection
Interpretable Machine Learning (IML)
SHAP
LIME
Time Series Analysis
Data Analysis
Filtering Techniques
Kalman Filter
Particle Filter
Biomanufacturing
Statistical Modeling
R

Data Scientist

Backyard Innovations Limited

Nigeria

September 2020 - July 2023

  • Led the development and deployment of predictive analytics models for smart home energy systems, using advanced machine learning techniques (e.g., time series forecasting with LSTM networks) to predict hourly energy consumption. This resulted in a 20% improvement in system efficiency and reduced energy waste.
  • Performed comprehensive analysis of weather data (temperature, humidity, solar radiation) and energy consumption data from smart homes to identify optimization opportunities, leading to significant operational cost savings and improved energy usage patterns.
  • Developed and maintained data pipelines to ingest, process, and store large volumes of time-series data from smart home devices, ensuring data quality and availability for analysis and modeling.
  • Created interactive dashboards and reports using Power BI to visualize key performance indicators (KPIs) and communicate insights to stakeholders, facilitating data-driven decision-making.
Python
Machine Learning
Predictive Analytics
Time Series Forecasting
LSTM Networks
Data Modeling
Data Analysis
Power BI
Data Pipelines
ETL
SQL
Data Visualization

Technical Sales Engineer

Fortizo Energy Resources Limited

Nigeria

February 2020 - July 2020

  • Developed and delivered technical bids and proposals, resulting in a significant increase in the company's contract win rate and improved client satisfaction.
  • Utilized expertise in process designs, equipment lists, and heat/material balances to ensure the accuracy and efficiency of project deliverables. Built and maintained strong client relationships to drive sales targets and revenue growth.
  • Conducted market research and competitive analysis to identify new business opportunities and inform sales strategies.
Technical Sales
Proposal Writing
Process Design
Client Relationship Management
Communication
Market Research
Competitive Analysis
Presentation Skills

Education

Ph.D. in Data Science and Analytics

University of Oklahoma

January 2025 - Present

GPA: 3.5

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • OU Data Science Club

M.S. in Data Science and Analytics

University of Oklahoma

August 2023 - December 2024

GPA: 3.5

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • Graduate Research Assistant
  • National Science Foundation (NSF) Agric AI coding challenge Winner
  • TeamElectra in the Air Selangor Data & Digital Hackathon 2024 in Malaysia (Finalist)

M.S. in Petroleum Engineering and Project Development

University of Port Harcourt

November 2018 - November 2019

GPA: 4.6

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • TeamElectra in the Air Selangor Data & Digital Hackathon 2024 in Malaysia (Finalist)

B.S. in Petroleum and Gas Engineering

University of Lagos

October 2012 - December 2016

GPA: 3.54

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • TeamElectra in the Air Selangor Data & Digital Hackathon 2024 in Malaysia (Finalist)

Hobbies & Interests

📺 Binge-Watching TV Shows

Watching popular series and discovering new favorites

🏔️ Hiking

Exploring Oklahoma's beautiful trails and mountain ranges

Camping

Connecting with nature and embracing outdoor adventures

🏀 Basketball

Playing pick-up games and following the NBA

✈️ Traveling

Exploring new cultures and broadening perspectives

🍳 Cooking

Trying new recipes and experimenting with different cuisines