If english language only has 44 sounds. How long until we can fakeaudio people...

Question

If english language only has 44 sounds. How long until we can fakeaudio people...

Robert Cruz

If english language only has 44 sounds. How long until we can fakeaudio people? Does this software already exist sort of like fakeapp that can pool through spoken audio, pull the sounds then use them to create new full audio?
>Example: Instead of in Rogue one where they fakeapped young leia into the movie they could really have her sound like a royal british princess?

Source:
dvusd.org/cms/lib/AZ01901092/Centricity/Domain/3795/Sound_Spelling_Chart.pdf

February 18, 2018 - 09:11

Other urls found in this thread:

deepmind.com/blog/wavenet-generative-model-raw-audio/
lyrebird.ai/demo/
youtube.com/watch?v=I3l4XLZ59iw
twitter.com/NSFWRedditImage

Aiden Long

they probably already can, it just isn't consumer technology yet

February 18, 2018 - 09:12

Michael Nelson

> it just isn't consumer technology yet
cant we just write it in python? That would work I think. I think I figured out how it could be done but noone has done it yet.

February 18, 2018 - 09:14

Chase Powell

well go do it and make money. don't go on any planes because they kill you that way

February 18, 2018 - 09:22

Aiden Cruz

>don't go on any planes because they kill you that way
well the fakeapp guy is alive, whats the diff between that and this? This is just audio, it should be way easier since all someone has to do is write it in python.

February 18, 2018 - 09:29

Landon Lewis

Google did something like this last year. deepmind.com/blog/wavenet-generative-model-raw-audio/

February 18, 2018 - 09:30

William Barnes

It already exists.

lyrebird.ai/demo/

February 18, 2018 - 09:30

Benjamin Butler

Do you niggers even Google? The stupidity of this thread hurts my brain... This is the same as going to a woodworking forum, asking if it's possible to create an electric powered saw [it will be called an "electric saw."]

February 18, 2018 - 09:32

Christopher Bennett

hmmm well that means its doable.

All we need is python to rip audio like it rips video and then generate that audio into statements.

It would be awesome imagine hearing Luke Skywalker and Princess leia doing a cover of justin beiber and mariah carey song All I want for Christmas is You.

February 18, 2018 - 09:38

Isaiah Bailey

>Google
get out

February 18, 2018 - 10:17

Camden Scott

that does exist already.
somebody once posted videos and screenshots and a slideshow iirc of this a few weeks ago.

can't recall the name though and it's still super new.
some other Cred Forumsfag is also working on an AI that makes women nude by removing their bikinis using an algorithm

exciting times in IT, in that regard

February 18, 2018 - 10:21

Luke Howard

What language has the most sounds, does any language cover all sounds?
I have an affinity for Japanese girls speaking English by the way.

February 18, 2018 - 10:25

Josiah Collins

>get out
great movie

February 18, 2018 - 10:29

Daniel Bailey

>planes
Rich and powerful people use planes the most.

February 18, 2018 - 10:45

Dylan Sullivan

this is a really stupid question

February 18, 2018 - 10:55

Austin Walker

I may be ignorant on the topic but at least I'm not rude.

February 18, 2018 - 10:58

James Rodriguez

be more fun to have audio than video, could use it everywhere then from youtube like to make us sound like other people like Jack Black!

February 18, 2018 - 11:01

Brandon Robinson

2016.
Adobe Voco.

February 18, 2018 - 11:01

Thomas Adams

that would be an accent, yes if we write a python script to rip sounds to generate sentences it should do it with the accent intact.

Easy peasy!

February 18, 2018 - 11:02

Jack Cook

don't reply to me ever again lil bitch

February 18, 2018 - 11:05

Landon Russell

I know there are some sounds used by languages that aren't used in other languages which makes it almost impossible for foreigners, but most of those are dead or African so it depends on if you mean all sounds in all languages or all sounds in all relevant languages.

February 18, 2018 - 11:06

Connor Rogers

>you will live long enough to have an AI assistant generate speech with a broken English accent
I have a reason to continue.

You have no power here, there's nothing you can do to stop me. Sorry you got upset but it's not my fault.

February 18, 2018 - 11:08

Parker Sullivan

dont talk to me or my wifes son ever again

February 18, 2018 - 11:10

John Jones

>I have a reason to continue.

id like someone to just pop open notepad bang out a few lines of python and BOOM! we got our audio ripper!

Im shocked noone has done that yet but figured out video which on magnitudes is much more difficult.

February 18, 2018 - 11:12

Jayden Sanchez

I'm sure most of the people doing it will still fuck it up, like the idiots plastering celeb faces onto porn actresses that aren't remotely their measurements.

February 18, 2018 - 12:21

Juan Gutierrez

I mean Vocaloid is kind of this, no? They just do't publish how they come up with the product.

February 18, 2018 - 12:21

Mason Martin

The literal state of Cred Forums.

Not to mention the autist who just learned how to run python who made this thread.

February 18, 2018 - 12:27

Brody Martinez

i thought they just hire a VA for that stuff?

February 18, 2018 - 12:27

Nicholas Torres

>Not to mention the autist who just learned how to run python who made this thread.
Clearly he is a master level Python in coding since he ran a GUI of a script.

Never forget that the worlds most elite hackers came from Cred Forums hitting the LOIC button.

Watch out guys we got a super hacker here!

February 18, 2018 - 12:29

Ryan Baker

Peaches, I love peaches

February 18, 2018 - 12:30

Jaxon Hall

Do y'all'd've think that every language has the same consonants?

February 18, 2018 - 12:31

Owen Miller

It's based off a person but the speech is obviously synthesized.

February 18, 2018 - 12:34

Anthony Myers

I am waiting for them to reproduce Frank Mullers voice, who narrated the first 4 audiobook of the dark tower, but died in a motorcycle accident and couldn't finish. There is plenty of data from the audiobook he read to make a decent voice I think, but one still must account for ingestion and emphasis. That is what would make it difficult and I would think that would have to be manually tinker with.

February 18, 2018 - 12:34

Juan Phillips

Inflection *

February 18, 2018 - 12:35

Michael Ortiz

"I'm really into vore" -Frank Muller

February 18, 2018 - 12:36

Isaiah Perez

how is it "obvious"
what if their just adding phase differential or reverb on purpose to a real voice?

February 18, 2018 - 12:40

Julian Phillips

is that true?

February 18, 2018 - 12:41

Nathan Richardson

No. user is saying what could be spoken by Frank muller once what I want is made.

February 18, 2018 - 12:47

Jaxon Ortiz

That is speech synthesis, it ceases to be a voice recording when it's modified like that, the original sample is but the product isn't. I'm not just being pedantic either, the significance here is that the samples are produced not replayed, with the amount of variations, if they were pre-recorded samples the install size would be immense so they have to be produced at runtime.

The other obvious tell is that sometimes they use multiple people to produce a single vocaloid.

I think the one you're replying to is just rolling with the joke kek

February 18, 2018 - 12:48

Brody Allen

It's already made.
It was 2 years ago.

youtube.com/watch?v=I3l4XLZ59iw

February 18, 2018 - 12:49

Aiden Wright

It was a play on the "ingestion" typo.

February 18, 2018 - 12:50

Jason Jenkins

>The other obvious tell is that sometimes they use multiple people to produce a single vocaloid.
interesting, these are things I do not know.

February 18, 2018 - 12:50

Mason White

FUCKING my autocorrect still needs some learning. They need a deepautocorrect before doing deepvoice desu

February 18, 2018 - 12:52

Ryan Evans

I can't find which one it is because there's a lot of them and sometimes they do contracts for other people which are vocaloidS but not part of VOCALOID the product. The opposite is common in their product (1 voice provider for 2 different vocaloids) the most popular example is Len and Rin.

February 18, 2018 - 12:57

Nicholas Hughes

Still doesn't sound natural though. I think if it followed the same machine learning technique by analyzing the dataset from his old reads it would better.

You don't always want to say the same word with the same inflection every time. It depends on the sentence.

February 18, 2018 - 12:59

Ryan Allen

>Still doesn't sound natural though. I think if it followed the same machine learning technique by analyzing the dataset from his old reads it would better.
why cant this be corrected using the same methods we use for animating faces in cartoon and video games? Just use a line simulating attitude that affects expression except in this case inflection.

It would be that easy really when you think about it but noone has figured it out even though Python is available online free.

February 18, 2018 - 13:04

Jose Cooper

I agree. I I have a mockup in my mind for how it might work but I wouldn't know how to actual code it since I am not an audio engineer

February 18, 2018 - 13:23

Andrew Wilson

I been thinking about it like a track layout in adobe with the bar for the attitude adjustment?

what are you thinking?

February 18, 2018 - 13:32

Gavin Sanchez

Basically that. Different filters for adjusting the voice in various ways to tweak it just right.

February 18, 2018 - 13:53

Sebastian Russell

true......

Been trying to figure this out logically. Maybe like how to determine bold type have 7 word phrases read indicating different inflections until the various emotions can be heard?

February 18, 2018 - 13:59

1 2 ... 5 Next

If english language only has 44 sounds. How long until we can fakeaudio people...

Last threads