[B! LLM][multimodal] arrowKatoのブックマーク

arrowKato id:arrowKato

LLMとmultimodalに関するarrowKatoのブックマーク (1)

Introduction to GPT-4o and GPT-4o mini | OpenAI Cookbook
GPT-4o ("o" for "omni") and GPT-4o mini are natively multimodal models designed to handle a combination of text, audio, and video inputs, and can generate outputs in text, audio, and image formats. GPT-4o mini is the lightweight version of GPT-4o. Background Before GPT-4o, users could interact with ChatGPT using Voice Mode, which operated with three separate models. GPT-4o integrates these capabil
arrowKato 2024/11/19
画像を入力にするときのサンプルコードあり

LLM

GPT-4o

GPT-4o-mini

multimodal
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx