Deep learning papers reading roadmap

annnnd · on Oct 21, 2016

Missing on the list: http://neuralnetworksanddeeplearning.com/

Great book for learning concepts and for getting a generic overview (but goes deep enough that you can jump straight into implementation if you want). I recommend it highly.

leblancfg · on Oct 21, 2016

# This downloads all the links in that page

# Just save README.md to the folder of your choice

  sed -ne 's/.*\(http[^")]*\).*/\1/p' < README.md | xargs wget -U 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:49.0) Gecko/20100101 Firefox/49.0'

leblancfg · on Oct 21, 2016

Turns out, there are also a bunch of extensionless files in there, too.

  for file in *; do if [ -f $file ] && [ ${file:(-3)} != "pdf" ]; then mv "$file" "${file}.pdf"; fi done

zk00006 · on Oct 21, 2016

Nice list, but there are too many papers like this and it is easy to get stuck in theory. I would suggest to grab some simple neural network (Darknet is great for that) code and read that first. If something does not make sense, find the theory from papers.

Scea91 · on Oct 21, 2016

This approach is dangerous in statistics and machine learning. Things often work on your toy examples but it may be for totally different reasons than you think and it can bite you later. Since it looks like it works to you, you will never look into the necessary theory. With the approach 'theory first' you might not get nice results as fast but I would bet that your understanding would be much deeper.

I am not saying that you need to read all the papers before coding a line, but the approach of coding your way into it will give you more shallower knowledge if you don't follow up with the theory.

nkozyra · on Oct 21, 2016

Agreed; I'd actually recommend against this. You should know the math and reasoning behind the specific algorithm rather than dissecting it via code.

zk00006 · on Oct 21, 2016

In fact there is no correct answer to that and it depends on your current state of knowledge and needs for your project. It also changes over time.

I would be careful to claim that with theoretical approach you will always get better understanding.

eftychis · on Oct 22, 2016

I agree with posts above. Diving straight into 'practice' in statistics (and other fields e.g. cryptography being the most notorious) leaves you open to a great many of pitfalls. Best case you will be inefficient with your approaches.

Your examples may work, a couple of testing sets giving you high confidence, and then you attempt to use it in the wild and everything falls apart.

At the same time machine learning is a lot about data cleaning, bootstrapping, picking the right algorithm with mininum iteration, minimizing your iteration cycle as much as possible etc which you don't gain until you actually mess around and get your hands dirty. Plus there are little implementation tidbits specific to each project.

s_ngularity · on Oct 22, 2016

That might work for some simpler machine learning algorithms, but in deep learning I think you'll have an even harder time correctly figuring out intuitively what's going on than you would learning the math.

Eridrus · on Oct 23, 2016

These papers are not very theoretical at all. You can get lucky if everything you need has been implemented in your library of choice, but most deep learning papers are highly practical engineering-driven affairs and brushing them off as unnecessary theory is just doing yourself a disservice.

yalogin · on Oct 21, 2016

A bit off topic but what does one need to know/read before doing the Udacity course on autonomous driving?

erikgaas · on Oct 25, 2016

You don't really have to know anything. And the current free course does not have anything to do with neural nets. They use monte carlo, kalman filters, particle filters, PID, etc. You should feel comfortable with probability theory and if you know some linear algebra, the kalman filter example is going to make a lot more sense.

amelius · on Oct 21, 2016

What are the best video lectures out there, with emphasis on theory (not coding)?

_gfrc · on Oct 21, 2016

I can highly recommend Hinton's course on Coursera - https://www.coursera.org/learn/neural-networks

It starts on October 31st, I took it a while ago and learned a lot. The course is gravitating towards the theoretical end of the spectrum. If you want something thats a bit easier to get into, I can recommend the course by Hugo Larochelle:

http://info.usherbrooke.ca/hlarochelle/neural_networks/conte...

pinouchon · on Oct 21, 2016

I recommend this course: CS231n Winter 2016 Stanford (Lecturer is Andrej Karpathy) https://www.youtube.com/playlist?list=PLLvH2FwAQhnpj1WEB-jHm...

bosie · on Oct 21, 2016

i quite enjoyed nando de freitas oxford lecture: https://www.youtube.com/watch?v=PlhFWT7vAEw

gamapuna · on Oct 21, 2016

In addition to cs231n which someone recommended, I would also suggest cs224d (stanford) and university of waterloo's course by ali ghodsi STAT 946

markovbling · on Oct 21, 2016

Great, thanks!

I find the problem is actually how to select what content to study given a time constraint like if you had 5 hours or 20 hours or 200 hours - what should you read?

Like an exhaustive list is great but it's an optimization problem - how do I maximise understanding subject to a time constraint - which resources do I select to maximise learning in x hours/days/years?

AlexCoventry · on Oct 21, 2016

  > which resources do I select to maximise learning in x 
  > hours/days/years?

Pick a particular open-source implementation of something you're interested in, say the Google image captioning technology, read the papers necessary to understand that, and play with that implementation.

The main problem with the plan implied by the OP's bibliography is that it's not practical at all. You'll end up with a catholic knowledge of the theory and no idea about how to use it.

laplace2 · on Oct 21, 2016

Is there a link to the book mentioned in item 1 (1.0 Book) as a PDF? The closest I could find is http://www.deeplearningbook.org/ which says I cannot get a PDF of the book, though clearly one existed at some point.

lrwilke · on Oct 21, 2016

Is this not it: https://github.com/HFTrader/DeepLearningBook/blob/master/Dee...

laplace2 · on Oct 21, 2016

Yes, thank you. I didn't understand that the [pdf] was a separate link from the broken webpage link.

etiam · on Oct 21, 2016

I don't think it's clear that there was ever an official PDF provided. There was some discussion of making a PDF from the HTML in a previous thread dedicated to that book. https://news.ycombinator.com/item?id=11446164

jonnys1 · on Oct 22, 2016

I was SOOO looking fow something like this!

ilaksh · on Oct 21, 2016

I would skim these to select something basic and then try to experiment with a real system with those documents as a guide/reference.

Try to actually learn one level at a time.

That is my approach right now. I have read a lot but without being 100% on the fundamentals it mostly goes over my head so I am backing up. I plan to try Tensorflow examples also but I expect it will be pretty shaky until my high-level practical surface knowledge can meet in the middle with my fundamental knowledge if I can keep progressing.