resented_ape t1_j3kiocc wrote on January 9, 2023 at 4:38 AM

Have you looked at other related papers? In particular, hypergradient descent and papers that cite that work (which you can find via google scholar)?

LahmacunBear OP t1_j3l5ub2 wrote on January 9, 2023 at 8:54 AM

Oh damn, that paper almost does exactly what I do. Huh. Oh well. Slightly different implementation though. I in contrast, use both grads from the same timestep and have an accumulated Ct.

SatoshiNotMe t1_j3lhi10 wrote on January 9, 2023 at 11:34 AM

Are either of these open source and easily usable as a PyTorch optimizer ?

[deleted] t1_j3lvzjy wrote on January 9, 2023 at 1:55 PM

[removed]

LahmacunBear OP t1_j3mrexi wrote on January 9, 2023 at 5:27 PM

Mine’s in Tensorflow 2.11 — I’m sure writing a PyTorch version wouldn’t be hard. The extra lines of the algorithm are three lines in my paper. I can share my code though?

SatoshiNotMe t1_j3n5p3v wrote on January 9, 2023 at 6:54 PM

Thanks! I was just curious for future reference. I’ll need to first read the papers to see if it can help with my projects.