tag :: deepmind | BibSonomy

Lesezeichen (verstecken)8
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1DeepMind papers at ICLR 2018 | DeepMind
https://deepmind.com/blog/deepmind-papers-iclr-2018/
vor 7 Jahren von @achakraborty
alle anzeigen
2018
collection
conference
deep-learning
deepmind
iclr
paper
reinforcement-learning
robotics
2018collectionconferencedeep-learningdeepmindiclrpaperreinforcement-learningrobotics
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Rohan Silva on why London is the best place for AI tech | London Evening Standard
Next time you’re at King’s Cross station, take a moment to think about this. Just yards from where you’re standing, the world’s most advanced artificial intelligence (AI) technology is being developed — by a London company called DeepMind.
vor 7 Jahren von @achakraborty
alle anzeigen
article
artificial-intelligence
blog
deep-learning
deepmind
uk
articleartificial-intelligenceblogdeep-learningdeepminduk
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3AlphaGo Zero: Learning from scratch | DeepMind
We introduce AlphaGo Zero, the latest evolution of AlphaGo, the first computer program to defeat a world champion at the ancient Chinese game of Go. Zero is even more powerful and is arguably the strongest Go player in history. Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting from completely random play. In doing so, it quickly surpassed human level of play and defeated the previously published champion-defeating version of AlphaGo by 100 games to 0.
vor 8 Jahren von @achakraborty
alle anzeigen
article
deepmind
google
reinforcement-learning
articledeepmindgooglereinforcement-learning
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3AlphaGo Zero: Learning from scratch | DeepMind
https://deepmind.com/blog/alphago-zero-learning-scratch/
vor 8 Jahren von @albinzehe
alle anzeigen
alphago
deeplearning
deepmind
neuralnet
reinforcementlearning
alphagodeeplearningdeepmindneuralnetreinforcementlearning
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Deep Reinforcement Learning | DeepMind
Humans excel at solving a wide variety of challenging problems, from low-level motor control through to high-level cognitive tasks. Our goal at DeepMind is to create artificial agents that can achieve a similar level of performance and generality. Like a human, our agents learn for themselves to achieve successful strategies that lead to the greatest long-term rewards.
vor 8 Jahren von @achakraborty
alle anzeigen
article
blog
deep-learning
deepmind
google
reinforcement-learning
articleblogdeep-learningdeepmindgooglereinforcement-learning
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Decoupled Neural Interfaces Using Synthetic Gradients | DeepMind
Neural networks are the workhorse of many of the algorithms developed at DeepMind. For example, AlphaGo uses convolutional neural networks to evaluate board positions in the game of Go and DQN and Deep Reinforcement Learning algorithms use neural networks to choose actions to play at super-human level on video games. This post introduces some of our latest research in progressing the capabilities and training procedures of neural networks called Decoupled Neural Interfaces using Synthetic Gradients. This work gives us a way to allow neural networks to communicate, to learn to send messages between themselves, in a decoupled, scalable manner paving the way for multiple neural networks to communicate with each other or improving the long term temporal dependency of recurrent networks.
vor 8 Jahren von @achakraborty
alle anzeigen
article
blog
deep-learning
deepmind
google
gradient-descent
articleblogdeep-learningdeepmindgooglegradient-descent
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1"But watching AlphaGo, I am not sure that's how it thinks of the game"
Human gaming tactics draw analogies from the physical world to hide the underlying complexity (chunking), and enable the players to think at a higher level. AlphaGo isnt limited(?) by physical world analogies.
vor 9 Jahren von @continued
alle anzeigen
AI
AlphaGo
Bilder
DeepMind
GO
Google
HN
HackerNews
Lee_Sedol
Twitter
YC
lang:en
media:image
AIAlphaGoBilderDeepMindGOGoogleHNHackerNewsLee_SedolTwitterYClang:enmedia:image
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Lee Sedol defeats AlphaGo in masterful comeback - Game 4
Excellent game commentary by Go Game Guru
vor 9 Jahren von @continued
alle anzeigen
AI
AlphaGo
DeepMind
Go
Google
Lee_Sedol
lang:en
AIAlphaGoDeepMindGoGoogleLee_Sedollang:en
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
1
⟩
⟩⟩

Publikationen (verstecken)4
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

14WaveNet: A Generative Model for Raw Audio
A. Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner, A. Senior, und K. Kavukcuoglu. (2016)cite arxiv:1609.03499.
vor 6 Jahren von @vsathish
alle anzeigen
cnn
deeplearning
deepmind
wavenet
cnndeeplearningdeepmindwavenet
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen
1A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D. Silver, T. Hubert, J. Schrittwieser, I. Antonoglou, M. Lai, A. Guez, M. Lanctot, L. Sifre, D. Kumaran, T. Graepel und 1 andere Autor(en). Science, 362 (6419): 1140--1144 (2018)
vor 7 Jahren von @loroch
alle anzeigen
AlphaZero
RL
chess
deep_learning
deepmind
go
shogi
AlphaZeroRLchessdeep_learningdeepmindgoshogi
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen
3Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Y. Zhu, Z. Wang, J. Merel, A. Rusu, T. Erez, S. Cabi, S. Tunyasuvunakool, J. Kramár, R. Hadsell, N. de Freitas und 1 andere Autor(en). (2018)cite arxiv:1802.09564Comment: 13 pages, 6 figures.
vor 7 Jahren von @achakraborty
alle anzeigen
2018
arxiv
deepmind
imitation-learning
reinforcement-learning
robotics
stanford
2018arxivdeepmindimitation-learningreinforcement-learningroboticsstanford
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen
4Mastering the game of Go without human knowledge
D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton und 7 andere Autor(en). Nature, (Oktober 2017)
vor 8 Jahren von @achakraborty
alle anzeigen
2017
deep-learning
deepmind
google
paper
reinforcement-learning
2017deep-learningdeepmindgooglepaperreinforcement-learning
(0)
KopierenLöschenDiese Publikation zur Ablage hinzufügen

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

Lesezeichen (verstecken)8
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

1DeepMind papers at ICLR 2018 | DeepMind

1Rohan Silva on why London is the best place for AI tech | London Evening Standard

3AlphaGo Zero: Learning from scratch | DeepMind

3AlphaGo Zero: Learning from scratch | DeepMind

1Deep Reinforcement Learning | DeepMind

1Decoupled Neural Interfaces Using Synthetic Gradients | DeepMind

1"But watching AlphaGo, I am not sure that's how it thinks of the game"

1Lee Sedol defeats AlphaGo in masterful comeback - Game 4

Publikationen (verstecken)4
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

14WaveNet: A Generative Model for Raw Audio

1A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

3Reinforcement and Imitation Learning for Diverse Visuomotor Skills

4Mastering the game of Go without human knowledge

Stöbern

Verwandte Tags

Lesezeichen (verstecken)8 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

Publikationen (verstecken)4 Anzeigeallesnur PublikationenPublikationen pro Seite5102050100 sortieren nachhinzugefügt amTitelAutorErscheinungsdatumEintragstypHilfe für erweiterte Sortierung... RSSBibTeXRDFmehr...

Stöbern

Verwandte Tags

Lesezeichen (verstecken)8
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

Publikationen (verstecken)4
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...