Tag: visual

📄

AIの10年で芸術はどう変わった？アーティストたちの本音 — Artists on a Decade of AI Evolution: An Interview Study of Affordances, Culture, and Artistic Practice with Machine Learning

@June 18, 2024 9:30 PM (GMT+2)

. . "Artists on a Decade of AI Evolution: An Interview Study of Affordances, Culture, and Artistic Practice with Machine Learning."

Paper

artethicsdesignhuman-aivisualNLPsocietycross-modal

March 25, 2026 12:50 PM (GMT+9)

📄

絵画を知らないAIが絵画を生成できるか — Art-free Diffusion

2024

アートを含まない学習データを学習したAIモデルをベースに、少数のアート作品の画像でLoRAを学習。きちんとそのアーティストの特徴を掴んだ画像が生成された。

Ren, Hui, Joanna Materzynska, Rohit Gandikota, David Bau, and Antonio Torralba. 2024. “Art-Free Generative Models: Art Creation Without Graphic Art Knowledge.” http://arxiv.org/abs/2412.00176.

Paper

visualtheoryethicsimage

December 9, 2024 1:59 PM (GMT+9)

👨‍👩‍👦

もしAIが「へのへのもへじ」を作ったら？ — CLIPと進化戦略を用いたコラージュ画像の生成

2021

画像とテキストがどれくらいマッチしているかを定量化するCLIPモデルを用いて、要素画像の配置を最適化。入力されたテキストにあったコラージュ画像を生成するシステム

CLIP-guided collage image optimization using Evolutionary Strategy

Project

visualcross-modal

December 11, 2021

👨‍👩‍👦

Botto—コミュニティのフィードバックに基づいてNFTアートを自動生成するBot

2021

CLIP+VQ-GANの仕組みを活用

Botto Project

Project

artvisualGAN

November 19, 2021

📄

Visual indeterminacy in GAN art

2020

GANが生成する画像の「●●ぽいけど、なんか違う...」という「不確定性」に着目し、現代アートの特徴との比較を行った上で、今後のGANアートの将来像を探る。

Hertzmann, A. (2020) ‘Visual indeterminacy in GAN art’, Leonardo. MIT Press Journals, 53(4), pp. 424–428.

Paper

arttheoryGANvisual

May 19, 2021

📄

Generating Long Sequences with Sparse Transformers

2019

スパースなTransformerの仕組みで計算量を抑える

Child, R. et al. (2019) ‘Generating Long Sequences with Sparse Transformers’, arXiv. arXiv. Available at: http://arxiv.org/abs/1904.10509 (Accessed: 29 January 2021).

Paper

musicvisual

May 16, 2021

💽

ArtEmis: Affective Language for Visual Art

2021

8万枚の絵画にクラウドソーシングで44万の言語情報を付加。

ArtEmis: Affective Language for Visual Art

Dataset

visualart

April 22, 2021

⚒️

CinemaNet

普通の画像認識モデルのようなオブジェクトの識別に加えて、カメラのアングルやフォーカスの当て方(ソフトフォーカス...)、撮影された時間帯(夕方、朝焼け)、場所などをタグ付け

CinemaNet by Anton Marini(vade), Rahul Somani

Tool

visual

March 3, 2021

AIを用いたAudio Visual – Stylizing Audio Reactive Visuals

2019

Han-Hung Lee, Da-Gin Wu, and Hwann-Tzong Chen, "Stylizing Audio Reactive Visuals", NeurlPS2019, (2019)

Paper

visualGAN

June 24, 2020

📄

様々なメディアのフレームを補間する – Depth-Aware Video Frame Interpolation

2020

様々なメディアのフレームを補間する – Depth-Aware Video Frame Interpolation

Paper

imagevisual

February 4, 2020

📄

音と映像の関係性の学習 – Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

2018

Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Paper

soundvisual

May 20, 2018

📄

動画からそれにあった音を生成 – Visual to Sound: Generating Natural Sound for Videos in the Wild

2018

Visual to Sound: Generating Natural Sound for Videos in the Wild

Paper

soundvisual

January 3, 2018

📄

画像から、好みのメッシュの3Dモデルを作成する -Neural 3D Mesh Renderer-

2017

Neural 3D Mesh Renderer

Paper

visualimage

November 25, 2017

💾

人の行動の動画データセット – AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

2017

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

Dataset

visualimage

October 23, 2017

👨‍👩‍👦

機械とともに描くポートレート – Delusions

2017

Delusions

demo

performancevisualimage

October 20, 2017

👨‍👩‍👦

顔写真から3Dモデルを生成 – Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression

2017

Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression

demo

artvisualimage

September 29, 2017

💾

動植物の画像データセット – The iNaturalist Challenge 2017 Dataset

2017

The iNaturalist Challenge 2017 Dataset

Dataset

visual

July 23, 2017

📄

過去の作品を学習することで本当に新しい作品が作れるのか?? – CAN: Creative Adversarial Networks Generating “Art” by Learning About Styles and Deviating from Style Norms

2017

CAN: Creative Adversarial Networks Generating “Art” by Learning About Styles and Deviating from Style Norms

Paper

GANartvisual

June 29, 2017

📄

人工知能の力を借り、3Dモデルを共作する – Interactive 3D Modeling with a Generative Adversarial Network –

2017

Interactive 3D Modeling with a Generative Adversarial Network

demo

visualGAN

June 25, 2017

📄

画像⇆音の生成 – Deep Cross-Modal Audio-Visual GenerationDeep Cross-Modal Audio-Visual Generation

2017

Deep Cross-Modal Audio-Visual Generation

Paper

visualsound

May 14, 2017

💾

車載カメラ画像データセット – Mapillary Vistas Dataset

2017

Mapillary Vistas Dataset

Dataset

visual

May 4, 2017

📄

適切なフォントの組み合わせを生成 – Fontjoy

2017

適切なフォントの組み合わせを生成 – Fontjoy

demo

visual

April 30, 2017

👨‍👩‍👦

未来を予測して動画を生成 – Generating Videos with Scene Dynamics –

2017

Generating Videos with Scene Dynamics

Project

visualimage

April 30, 2017

📄

一枚の写真からその後の人の動きを予測 – Forecasting Human Dynamics from Static Images

2017

Forecasting Human Dynamics from Static Images

Paper

visualimageperformance

April 25, 2017

📄

横顔から正面から見た顔を生成 – Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

2017

Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis

Paper

GANvisual

April 18, 2017

👨‍👩‍👦

機械学習を用いたドラムマシン – The Infinite Drum Machine : Thousands of everyday sounds, organized using machine learning.

2017

The Infinite Drum Machine : Thousands of everyday sounds, organized using machine learning

Project

musicvisualsound

April 7, 2017

📄

見えない体を見る. 一人称視点の映像からカメラをつけている人の姿勢を推定. – Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video

2017

Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video

Paper

visualimage

April 6, 2017

📄

Attributesによる画像の美しさ判定 – Photo Aesthetics Ranking Network with Attributes and Content Adaptation

2017

Photo Aesthetics Ranking Network with Attributes and Content Adaptation

Paper

visualimage

April 4, 2017

📄

CycleGAN 対訳がなくても画像を翻訳(変換) – Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

2017

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

Project

visualimage

April 1, 2017

📄

ストリートビューの画像の解析による人口統計調査 – Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US

2017

Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US

Paper

visualimage

March 31, 2017

📄

ファッション・トレンドの解析. 東京は… – Changing Fashion Cultures

2017

Changing Fashion Cultures

Paper

visualart

March 29, 2017

📄

写真のStyle Transfer- Deep Photo Style Transfer

2017

Deep Photo Style Transfer

Paper

visualimage

March 25, 2017

👨‍👩‍👦

目が回ります – DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation

2017

DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation

demo

visual

March 19, 2017

📄

DeepDreamを用いたのドローイングツール- DreamCanvas

2017

DeepDreamを用いたのドローイングツール- DreamCanvas

demo

visual

March 15, 2017

📄

フォントのStyle Transfer? – Awesome Typography: Statistics-Based Text Effects Transfer

2017

YANG, Shuai, et al. "Awesome typography: Statistics-based text effects transfer", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.7464-7473, (2017)

Paper

visualimage

February 5, 2017

📄

AENet: Learning Deep Audio Features for Video Analysis

2017

AENet: Learning Deep Audio Features for Video Analysis

Paper

visualmusic

January 20, 2017

👨‍👩‍👦

T-SNE MAP – Google Arts and Culture Experiments

2016

T-SNE MAP – Google Arts and Culture Experiments

Project

performancevisual

January 13, 2017

📄

Unsupervised Learning of 3D Structure from Images

2016

Unsupervised Learning of 3D Structure from Images

Paper

visualimage

December 6, 2016

Name	Rows
1	About
2	Facebook
3	Twitter
4	Qosmo
5	Keio SFC Computational Creativity Lab

Tag: visual

Tag: visual

💾
References: AI

Footer

Tag: visual

Tag: visual

💾References: AI

Footer

💾
References: AI