Okay, so yesterday I dove into something kinda new for me: playing around with Jessica Delp’s stuff. You know, the AI voice cloning thing? I’d been hearing about it, seeing it pop up everywhere, and figured, hey, why not give it a shot?

First off, I started by trying to find a good dataset. Found a bunch of her interviews and podcasts online. I spent a solid hour just downloading audio, making sure it was all clean and without too much background noise. That’s the boring part, but you gotta do it right, otherwise the whole thing is garbage in, garbage out, right?
Next up, the fun part, but also the kinda confusing part: the actual AI stuff. I fired up Google Colab – free GPUs, can’t beat that! I followed a tutorial I found on some random blog (can’t remember which one exactly), which walked me through setting up the environment and installing all the necessary libraries. Lots of copy-pasting code, I ain’t gonna lie. And lots of error messages I had to Google. Stack Overflow is my best friend, seriously.
Then, it was time to actually feed the audio data into the AI. This part took forever! The model started training, and I just sat there, watching the progress bar slowly inch forward. I think it took about 4 hours, maybe more, before it finally finished. I almost fell asleep a couple of times.
Once the training was done, I tried generating some speech. I typed in a few random sentences, things I thought Jessica Delp might say. Clicked the “generate” button, and… it kinda worked? The voice was definitely recognizable, but it sounded a bit robotic and unnatural. It needed tweaking.
I messed around with the settings a bit, trying different parameters and configurations. I found that lowering the “temperature” made the voice sound more stable, but also a bit more monotone. Raising it made it sound more expressive, but also introduced more glitches and artifacts. It was a delicate balance.

After a few hours of tweaking, I finally got something that sounded pretty decent. It wasn’t perfect, but it was good enough to fool some people, I think. I even showed it to my wife, and she was kinda freaked out by how realistic it sounded.
Lessons Learned:
- Data quality is key. Garbage in, garbage out, remember?
- Be patient. AI training takes time.
- Don’t be afraid to experiment. There’s no magic formula.
- Stack Overflow is your friend.
Overall, it was a fun and interesting project. I learned a lot about AI voice cloning, and I got to play around with some cool technology. Would I do it again? Absolutely. But next time, I’ll make sure to have a stronger coffee and maybe a nap beforehand.
Final Thoughts
This whole AI voice cloning thing is pretty wild. It’s got a lot of potential, but it also raises some serious ethical questions. I mean, what’s stopping someone from using this technology to create fake news or impersonate someone without their permission? It’s something we need to think about.