Comment by antiochIst
Comment by antiochIst 2 days ago
Currently I'm using Snowflake’s Arctic embedding model on the whole story not just the title, to cluster stories. There are still some issues, but its not as simple as looking at title publish date.
Yea, I need to do some work on improving first to publish... currently I'm relying pretty heavily on the published date provided in the story itself, but sometimes that is wrong and makes it look like a later publisher was first to publish.