Comment by macleginn

Comment by macleginn 2 days ago

So this looks essentially like continuous prompting (see prefix tuning) with RL-driven selection of what to present as tokens and what as continuous inputs (embeddings).