Work

Data generation System

Devtools
Software Testing

A synthetic data generation tool for testing software applications and also used for data intensive systems

Iridescent ripples of a bright blue and pink liquid

Stacks/Tools Used

Python, Streamlit

How I got involved

Tell me and I forget. Teach me and I remember. Involve me and I learn.

I was working on a consent management system with a team, before launching, there was need for load testing and beta-testing the product, then i decide to create tool to generate synthetic data to serve this purpose. Also, a friend of mine needed to train some ML model , but could not find data to use, then i showed him this tool which was a valuable toolkit for his model.

Why was the project complex

The project was averagely complex as data requirement varies by application, the complexity give rise to creating different option, but the first version used some pre-defined data models

What i did to drive success

I was not deterred by the complexity and technical limitation, instead i focused on breaking the complexity of the required data models to a generic models.

Driving factor for success

The driving factor for the project was when i got an actual user of the tool, which was a master student at IVY league , he needed some synthetic data for a machine learning model to detect a software intrusion detection.

Conflict that arose

Some conflict arose as the project implementation and feature was going out of scope

Lessons learnt

  • Don’t focus the discussion too much on the problem, focus more on the solution

How I have grown

I am now learning the inclusion of no-code system in the applications.

Project Assets