advertisement
advertisement

在罗纳德·里根(Ronald Reagan)的嘴里说单词

您宁愿与Angelina Jolie交谈,而不是Siri吗?如果罗纳德·里根(Ronald Reagan)提供了RNC主题演讲地址怎么办?将文本转换为视频的新技术可以实现。

在罗纳德·里根(Ronald Reagan)的嘴里说单词
通过拍摄讲各种短语的人,Seyer可以开始构建视频化身。
advertisement
advertisement

“Form is henceforth divorced from matter,” Oliver Wendell Holmesof photography in 1859. I was reminded of Holmes’s sweeping pronouncement in talking with Behrooz Rezvani, the CEO of the curious, and more than a tad eerie,Seyyer。Rezvani解释说,将来,我们可能会有一个视频存在与身体不同的视频。继续阅读以了解有关霸道父母的灵丹妙药,罗纳德·里根(Ronald Reagan)的幽灵以及安吉丽娜(Angelina)可能未来作为Siri替代品的信息。

FAST COMPANY: What’s the idea behind Seyyer?

Behrooz Rezvani

BEHROOZ REZVANI: The genesis of the whole thing is the idea to convert text to video. In order to convert text to video for a particular person, and have that person be animated saying certain things, you have to learn a lot about the way that person talks, and their facial expressions.

advertisement

So you mean, I would write an email or text, and it would show up for my recipient as my face talking.

That’s the ultimate goal. If the recipient has a model of you on their phone, then when they receive the text, they can actually see you talking on it.

你为什么要那个?

In many countries around the world, there’s a kind of bandwidth starvation. What if you want to share a magical moment with grandma, with her grandkids texting from far away distances? Also, texting is now more popular than calling someone. A change really took place around 2008 or 2009.

我一直感到内gui,因为我的祖母没有电子邮件或接收短信,所以我们只能通过电话进行交流,但是我再也不会打电话给任何人了。

To me that happened because my son would not answer my calls, but he would respond to my texts. That was my “Gee” moment. I thought, what does it take to actually hear his voice or see his face talking to me?

advertisement

So the idea for your company comes from your son refusing to take your calls? This is stereotypically a Jewish mother problem.

There was some epiphany around both kind of topics. Getting forced to use SMS by my son, and also the lack of bandwidth in the developing world.

这个想法是:如果我们不能在FaceTime上具有带宽,那么至少我们会模拟它。

正确的。一旦这个想法开始形成,我意识到还有很多其他事情将变得可能是可能的:发短信,电子书,Twitter。想象一下,人们的推文活跃起来 - 因此,安德森·库珀(Anderson Cooper)将在某个地方发推文,突然他的照片突然出现在Twitter上。

“Applications include video texting, books read by their author, or by your favorite actor. Angelina Jolie could be the one setting up your schedule, instead of Siri.”

当然,您无法与安德森·库珀(Anderson Cooper)录制无限数量的视频任何事物he might write into video. So how does your tech work?

对于完全控制,显然您无法记录所有内容 - 它变得淫秽。因此,我们开发模型。我们记录了一个人的足够的视觉表达方式,我们可以使用一系列情绪和表达方式。然后,我们为音频做同样的事情:这个人会在一定时间内进行交谈,我们记录了我们建立一个模型。对于不存在的词汇和表达,我们可以从过去的历史中插入它们。

advertisement

At this point let’s have readers take a look at this video you recently release demonstrating your technology:

当我第一次看到这个时,我以为您只是拿了22秒的里根夹,嘴里摆弄了。您做的不只是在这里改变他的嘴吗?

绝对地。我们改变了整个脸。嘴的变化都与脸部的其余部分相连。不止一组22秒的数据。我们查看了大约20分钟的总视频来提取模型。

In behind-the-scenes features on Pixar movies, you see “wireframe” builds of the animation. Is there a wireframe here?

有一个线框 - 但不是您在卡通中熟悉的方式。您会有大量的线框可能会移动的可能变化 - 例如,当嘴巴张开时,根据表达式,脸颊可能位于另一个位置。

advertisement

从本质上讲,里根的形象是一种模型或木偶,您可以通过各种方式操纵它们。

正确的。We’re not modifying it–we are generating the face from scratch.

这是全新的技术吗?

据我们的知识,我们与许多专家进行了交谈,这是第一次。它实际上非常复杂和困难。

Why’d you choose to demo on Reagan? Isn’t it partisan, and also kind of creepy?

我将承担所有的荣誉和责任。

advertisement

You should have reached out to the RNC and told them Reagan could have given the keynote.

我们不想特别参加任何一个聚会。

您如何在不久的将来将技术商业化?

One of the most interesting things for us is the advertising space. To a lot of advertising experts, personalization and video are important. If you have a brand–take for example the T-Mobile girl. It takes a lot of time and money to get these actors in front of the camera to shoot them. Now if you want to update the message about the brand, it may be expensive and impractical to get these guys in front of the camera again. So the question was, can we do this one-time shoot and generate any message dynamically?

T-Mobile女孩的经纪人现在正在痛苦中尖叫。

不,我认为T-Mobile女孩会非常高兴。她可以继续将自己的形象获利,而她的经纪人也会得到削减。

advertisement

But licensing your likeness for infinite permutations–that’s kind of scary.

All these things are negotiated. I think they would not allow–I’m just making it up–using her image more than once a week, or not more than two times a quarter.

那么,您是否有兴趣广告商或品牌?

在过去的一周中,我进行了一些令人兴奋的对话。我不知道他们会去哪里,但是有一些真正的大人物。在接下来的几个月中,我们可以提出令人兴奋的第一个应用程序。

您在五到十年内在哪里想象这项技术?

我坚信文本对视频将是一个主要的支柱,但是我不知道它是否会在五到十年内盛开。但是应用程序包括视频发短信,作者阅读的书籍或您最喜欢的演员。安吉丽娜·朱莉(Angelina Jolie)可能是设定您的日程安排的人,而不是Siri。

advertisement

那是在交易。但是它不会稀释安吉丽娜的品牌吗?

我不知道。假设有人提供数亿美元...

Returning to the idea of having a text-to-video chat with a family member–won’t there be the issue of theUncanny Valley?

我认为这项技术将达到我们无法分辨差异的地步。

但是,如果有人可以劫持我妈妈的肖像并模仿她,那不是问题吗?

With every new technology and paradigm, we have to deal with philosophical questions of how to prevent abuse. There are several ways we think we could do this: with a watermark on the audio or video, for instance. There are ways that you can assure that people know the difference, which one is real, and which one is not real.

advertisement

This interview has been condensed and edited. For more from the Fast Talk interview series,点击这里。知道有人会成为一个很好的快速谈话主题吗?Mention it to David Zax

advertisement
advertisement
advertisement

关于作者

David Zax是Fast Company的贡献者。德赢提款他的写作出现在许多出版物中,包括史密森尼,板岩,有线和《华尔街日报》

More