overview for 2dollarsim

Linked original reddit post, but this didn't work for me. I had to take a bunch of extra steps so I've written a tutorial. Original instructions here which I'll refer to, so you don't have to visit reddit. My revised tutorial with all instructions will follow this in the replies, please post questions as a new post in this community, I've locked this thread so that the tutorial remains easily accessible.

Zyin 24 points 2 months ago*

Instructions on how to get this setup if you've never used Jupyter before, like me. I'm not an expert at this, so don't respond asking for technical help.

If you've never done stuff that needs Python before, you'll need to install Pip and Git. Google for the download links. If you have Automatic1111 installed already you already have Pip and Git.

Install the repo. It will be installed in the folder where you open the cmd window:

git clone https://github.com/serp-ai/bark-with-voice-clone

Open a new cmd window in newly downloaded repo's folder (or cd into it) and run it's installation stuff:

pip install .

Install Jupyter notebook. It's basically Google Collab, but ran locally:

pip install jupyterlab (this one may not be needed, I did it anyway)

pip install notebook

If you are on windows, you'll need these to do audio code stuff with Python:

pip install soundfile

pip install ipywidgets

You need to have Torch 2 installed. You can do that with this command (will take a while to download/install):

pip3 install numpy --pre torch torchvision torchaudio --force-reinstall --index-url https://download.pytorch.org/whl/nightly/cu118

To check your current Torch version, open a new cmd window and type these in one at a time:

python import torch print(torch.__version__) #(mine says 2.1.0.dev20230421+cu118)

Now everything is installed. Create a folder called "output" in the bark folder, which will be needed later to prevent a permissions error.

Run Jupyter Notebook while in the bark folder:

jupyter notebook

This will open a new browser tab wit the Jupyter interface. Navigate to /notebooks/generate.ipynb

This is very similar to Google Collab where you run blocks of code. Click on the first block of code and click Run. If the code block has a "[*]" next to it, then it is still processing, just give it a minute to finish.

This will take a while and download a bunch of stuff.

If it manages to finish without errors, run blocks 2 and 3. In block 3, change the line to: filepath = "output/audio.wav" to prevent a permissions related error (remove the leading "/").

You can get different voices by changing the voice_name variable in block 1. Voices are installed at: bark\assets\prompts

For reference on my 3060 12GB, it took 90 seconds to generate 13 seconds of audio. The voice models that come out of the box create a robotic sounding voice, not even close to the quality of ElevenLabs. The voice that I created using /notebooks/clone_voice.ipynb with my own voice turned out terrible and was completely unusable, maybe I did something wrong with that, not sure.

If you want to test the voice clone using your own voice, and you record a voice sample using windows Voice Recorder, you can convert the .m4a file to .wav with ffmpeg (separate download):

ffmpeg -i "C:\Users\USER\Documents\Sound recordings\Recording.m4a" "C:\path\to\bark-with-voice-clone\ ___

Funny title in c/lemmyshitpost@lemmy.world

[–] 2dollarsim@lemmy.world 4 points 2 years ago

Some of these were absolute gold.. I'm happy to see these coming back!

Which interface are you primarily using for Stable diffusion? in c/stable_diffusion@lemmy.dbzer0.com

[–] 2dollarsim@lemmy.world 1 points 2 years ago (1 children)

I would agree, but the rate of innovation in AI is so unpredictable that it could go either way.

Which interface are you primarily using for Stable diffusion? in c/stable_diffusion@lemmy.dbzer0.com

[–] 2dollarsim@lemmy.world 1 points 2 years ago (4 children)

Haha it won't be a joke next week when the new text-to-video model comes out

recommended gaming controller in c/linux_gaming@lemmy.world

[–] 2dollarsim@lemmy.world 4 points 2 years ago

PS4 controller forever. It 'just works' and I'm used to it after the decades. Can't stand the PS5 controller. It feels too big like the first xbox controller.

Back on meds after a couple months. Did not miss this. in c/adhd@lemmy.world

[–] 2dollarsim@lemmy.world 6 points 2 years ago (1 children)

Strange.. my adhd leads me to forget to eat. I get so absorbed in today's obsession that I wonder why I'm so fatigued at 1pm after not eating anything after a tiny breakfast.

My Strained Journey Through Diagnosis in The Worst Country in The World in c/adhd@lemmy.world

[–] 2dollarsim@lemmy.world 2 points 2 years ago

Fuck dude. That's heartbreaking. I'm almost 37 and I'm only just now motivating myself to get treatment. Don't give up hope, keep persevering!

Current status of Intel Arc on Linux in c/linux_gaming@lemmy.world

[–] 2dollarsim@lemmy.world 1 points 2 years ago

Laaaaaame. At least you managed to get a refund!

I take it you're aware of protondb and tried the suggestions there? There's a few people reporting recently that it works fine for them. Maybe recent updates have fixed issues

Which interface are you primarily using for Stable diffusion? in c/stable_diffusion@lemmy.dbzer0.com

[–] 2dollarsim@lemmy.world 1 points 2 years ago (6 children)

Not for videos, we are still quite a way from that yet.

Which interface are you primarily using for Stable diffusion? in c/stable_diffusion@lemmy.dbzer0.com

[–] 2dollarsim@lemmy.world 6 points 2 years ago

Automatic1111 is the second one I tried and I never left. It's the best.

Outrage in Lebanon after girl, 6, dies due to sexual assault in c/world@lemmy.world

[–] 2dollarsim@lemmy.world 1 points 2 years ago

WTAF.

1

The shadow (lemmy.world)

submitted 2 years ago by 2dollarsim@lemmy.world to c/dreaming@lemmy.world

0 comments fedilink

I enter my son's room because I heard talking, meaning he isn't going to sleep.

When I go in, I find his cousin there, who won't stop laughing. I grab his arm and ask him why he is there, and I realise there is no way he could be there. I angrily yank his arm and demand he tells me how he is here, why he is here, but he just keeps laughing.

I hear ragged breathing from my son, like he is struggling to breathe or being choked, and I turn back to see his head and shoulders covered in a dark shadow. I let go of his cousin and rush towards my son, and the shadow leaps onto my face making the world completely dark.

I wake up breathing quickly.

1

Reddit:pixelnull: My tips for people new to Pygmalion to get better responses and hopefully this clears up a few things about Pygmalion generally. (www.reddit.com)

submitted 2 years ago by 2dollarsim@lemmy.world to c/oobabooga@lemmy.world

0 comments fedilink

really good tips here!

1

AI news this week (www.youtube.com)

submitted 2 years ago* (last edited 2 years ago) by 2dollarsim@lemmy.world to c/oobabooga@lemmy.world

0 comments fedilink

1

Aitrepreneur: NEW ExLLAMA Breakthrough! 8K TOKENS! LESS VRAM & SPEED BOOST! (www.youtube.com)

submitted 2 years ago by 2dollarsim@lemmy.world to c/oobabooga@lemmy.world

0 comments fedilink

Tested it myself, huge improvement!

1

First post (lemmy.world)

submitted 2 years ago by 2dollarsim@lemmy.world to c/oobabooga@lemmy.world

0 comments fedilink

I'll try post as much good content here as possible to get it started

2

Intense fantasy-style dream I had last night (not nsfw) (lemmy.world)

submitted 2 years ago by 2dollarsim@lemmy.world to c/dreaming@lemmy.world

0 comments fedilink

Part 1: I was with friends discussing a book that we all enjoyed. Then I was in a house, and I realised that the book was connected to the house. A deep tunnel was being dug, to find something. We were going to find the tunnel. I walked into the other room to light the fire. I picked up the piece of coal to put in the fireplace. It was shaped like the head of an animal. It appeared to be possessed by some malevolent spirit. People were worried about the spirit, but I was antagonistic, and went hunting to find it. Then I can remember entering a large concrete building. A big hairy creature was standing at an opening, trying to fix some clockwork mechanism with chains. He complained that I had made things difficult for him, the chains weren't aligning. He then finally got a large weight attached to a chain to an opening, and let it go, and it started dragging the chains as it slid out.

Part 2: I went outside, and saw the chains had started to pull a surface of lead out. I was on top of a long concrete slope, but now the surface was covered in lead. It was like fast flowing liquid metal, except it was solid. I could walk on it, but every second it was changing, appearing to be a river of metal. I enjoyed running down the slope, and up the sides of the walls as the metal flowed over them. I reached the bottom, and was with 2 people: my brother and our close friend. We followed a river until we came to a dam made of terraced stone blocks. It was quite well made, and obviously for people to enjoy walking beside. But we had noticed that the river had been flowing towards the dam, and from the pool at the bottom, small trickles were running up the hill to the top of the dam. My friend commented that 'if you found water that behaved that way, wouldn't you build something to try to stop it?' And it seemed this new fancy stone dam was built on top of an ancient structure.

Part 3: We continued walking and at the top of the dam we were in a forest. I split off from the group and noticed in the trees, there were hanging bunches of raspberries. I ate one, and it was delicious. I yelled to the others to try them too, and then noticed that all sorts of different delicious looking berries were hanging down, some even looked and tasted like gummy worms. And then I found the book on the ground, that I mentioned earlier. Things seemed very bad. I ran back over to my friend, who was then acting crazy, and was trying to eat a plastic bag. He tried to brush me off when I tried to get him to stop. I grabbed him by his head and I could see his eyes didn't look right. I told him "I love you bro, I wouldn't be telling you to do this if it wasn't deadly serious. You have to snap out of it. trust me!" And he seemed to understand, and took the plastic out of his mouth.

Part 4: Then we walked out of the forest and found ourselves in a room, like a small cafe with windows where we could see the street. Me and my brother were there, and our friend was seated at a table, with a collection of odd things arranged in a specific way. Things like, stones, leaves, half a walnut shell. And there was a woman, she had black hair. She said "and now the last part. A close friend makes 2 brothers realise they have betrayed each other." and the friend moved two objects on the table into different positions. "it's done." he said. And then I realised, that this has all happened before. We had been manipulated again into completing their ritual.

Final part:

I started to cry and panic a bit, and I yelled at the two of them: "We have been trying to escape but they keep bringing us back here!" The woman told my brother and my friend, "It's not a good idea to look at the sky." But I knew what was there, and I told the other 2 they should look and see for themselves. Outside the world had changed, there was different technology, strange vehicles, and people dressed very differently. Covering most of the sky above us, there was a huge eye, surrounded by dark brown fur. It was watching the world, while everyone tried to avoid it's gaze. And as I looked out the window, moving just enough to glimpse the edge of the eye, I knew I didn't want it to see me.