VideoPoet

VideoPoet
	File:Dog popcorn with audio 6781BD0C.webm "一隻狗在電影院裏吃爆米花" File:Drums with audio DA80C510.webm "一隻戴着帽子、太陽眼鏡和皮夾克的泰迪熊正在打鼓" 由該模型生成的範例影片來自於文字。
開發者	Google
首次發佈	2024年2月8日，2年前
目前版本	Module:EditAtWikidata第29行Lua錯誤：attempt to index field 'wikibase' (a nil value)
原始碼庫	{{URL\|example.com\|可选的显示文本}}; Module:EditAtWikidata第29行Lua錯誤：attempt to index field 'wikibase' (a nil value)
引擎	Module:EditAtWikidata第29行Lua錯誤：attempt to index field 'wikibase' (a nil value)
類型	大型語言模型
許可協定	Module:EditAtWikidata第29行Lua錯誤：attempt to index field 'wikibase' (a nil value)

VideoPoet是由 Google Research 於 2023 年開發的一款大型語言模型，主要用於影片製作。^[1]^[2]^[3]^[4]該模型能將靜態影像轉換為動畫。^[5] VideoPoet 支援文字、影像和影片作為輸入，並能將這些輸入轉換成多種格式。^[4]該模型於 2023 年 12 月 19 日正式公開。^[1]VideoPoet 使用自我迴歸模型。

參考資料[編輯]

^ ^1.0 ^1.1 Krithika, K. L. Google Unveils VideoPoet, a New LLM for Video Generation. Analytics India Magazine. 2023-12-20 [2024-04-29]. （原始內容存檔於2024-05-21）（en-US）.
^ Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu. VideoPoet: A Large Language Model for Zero-Shot Video Generation. 2023-12-21. arXiv:2312.14125 可免費查閱 [cs.CV].
^ Google has introduced VideoPOET breaking new ground in coherent video generation. Gizmochina. 2023-12-21 [2024-08-28]. （原始內容存檔於2024-03-06）.
^ ^4.0 ^4.1 VideoPoet. Google Research. [2024-04-29]. （原始內容存檔於2025-01-30）（English）.
^ Franzen, Carl. Google’s new multimodal AI video generator VideoPoet looks incredible. VentureBeat. 2023-12-20 [2024-08-28]. （原始內容存檔於2025-01-14）.

外部連結[編輯]

File:Commons-logo.svg 維基共享資源上的相關多媒體資源：Module:Commons_link第64行Lua錯誤：attempt to index field 'wikibase' (a nil value)

小作品圖示

這是一篇關於Google的小作品。您可以透過編輯或修訂擴充其內容。

小作品圖示

這是一篇人工智能相關小作品。您可以透過編輯或修訂擴充其內容。

[:1-1] 1.0 ^1.1 Krithika, K. L. Google Unveils VideoPoet, a New LLM for Video Generation. Analytics India Magazine. 2023-12-20 [2024-04-29]. （原始內容存檔於2024-05-21）（en-US）.

[2] Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu. VideoPoet: A Large Language Model for Zero-Shot Video Generation. 2023-12-21. arXiv:2312.14125 可免費查閱 [cs.CV].

[3] Google has introduced VideoPOET breaking new ground in coherent video generation. Gizmochina. 2023-12-21 [2024-08-28]. （原始內容存檔於2024-03-06）.

[:0-4] 4.0 ^4.1 VideoPoet. Google Research. [2024-04-29]. （原始內容存檔於2025-01-30）（English）.

[5] Franzen, Carl. Google’s new multimodal AI video generator VideoPoet looks incredible. VentureBeat. 2023-12-20 [2024-08-28]. （原始內容存檔於2025-01-14）.

[1]

[2]

[3]

[4]

[5]

VideoPoet

參考資料[編輯]

外部連結[編輯]

導覽菜單

搜尋