What to know about this new Chinese text-to-video AI model

The short-video platform, which has over 600 million active users, announced the new tool on June 6. It’s called Kling. Like OpenAI’s Sora model, Kling is able to generate videos “up to two minutes long with a frame rate of 30fps and video resolution up to 1080p,” the company says on its website.

But unlike Sora, which still remains inaccessible to the public four months after OpenAI trialed it, Kling soon started letting people try the model themselves.

I was one of them. I got access to it after downloading Kuaishou’s video-editing tool, signing up with a Chinese number, getting on a waitlist, and filling out an additional form through Kuaishou’s user feedback groups. The model can’t process prompts written entirely in English, but you can get around that by either translating the phrase you want to use into Chinese or including one or two Chinese words.

So, first things first. Here are a few results I generated with Kling to show you what it’s like. Remember Sora’s impressive demo video of Tokyo’s street scenes or the cat darting through a garden? Here are Kling’s takes:

Remember the image of Dall-E’s horse-riding astronaut? I asked Kling to generate a video version too.

There are a few things worth applauding here. None of these videos deviates from the prompt much, and the physics seem right—the panning of the camera, the ruffling leaves, and the way the horse and astronaut turn, showing Earth behind them. The generation process took around three minutes for each of them. Not the fastest, but totally acceptable.

But there are obvious shortcomings, too. The videos, while 720p in format, seem blurry and grainy; sometimes Kling ignores a major request in the prompt; and most important, all videos generated now are capped at five seconds long, which makes them far less dynamic or complex.

However, it’s not really fair to compare these results with things like Sora’s demos, which are hand-picked by OpenAI to release to the public and probably represent better-than-average results. These Kling videos are from the first attempts I had with each prompt, and I rarely included prompt-engineering keywords like “8k, photorealism” to fine-tune the results.

#Chinese #texttovideo #model

What's Hot

Four ways to protect your art from AI

Justice Department calls for break up of Google and sale of Chrome

AI can now create a replica of your personality

What to know about this new Chinese text-to-video AI model

Four ways to protect your art from AI

AI can now create a replica of your personality

Deepfakes of Elon Musk are contributing to billions of dollars in fraud losses in the U.S.

Deepfakes of Elon Musk are contributing to billions of dollars in fraud losses in the U.S.

Leave A Reply Cancel Reply

Four ways to protect your art from AI

Justice Department calls for break up of Google and sale of Chrome

AI can now create a replica of your personality

Reddit is down for many users, according to DownDetector. Here’s what to know.

Deepfakes of Elon Musk are contributing to billions of dollars in fraud losses in the U.S.

Subscribe to Updates

What's Hot

What to know about this new Chinese text-to-video AI model

Related Posts

Leave A Reply Cancel Reply