Comment by ekjhgkejhgk

Comment by ekjhgkejhgk 11 hours ago

0 replies

Correct, that's [2]. In [2] they even say "[we] derive de main result using the approach first proposed in " and cite [1]. So the paper that everyone knows, in English (and with Bengio), explictly say that the original idea is in a paper in German, and still the scientific community chose not to cite the German original.

[1] https://people.idsia.ch/~juergen/SeppHochreiter1991ThesisAdv...

[2] https://sferics.idsia.ch/pub/juergen/gradientflow.pdf