Updated practice for review articles and position papers in ArXiv CS category

(blog.arxiv.org)

488 points by dw64 2 days ago

efitz 2 days ago

There is a general problem with rewarding people for the volume of stuff they create, rather than the quality.

If you incentivize researchers to publish papers, individuals will find ways to game the system, meeting the minimum quality bar, while taking the least effort to create the most papers and thereby receive the greatest reward.

Similarly, if you reward content creators based on views, you will get view maximization behaviors. If you reward ad placement based on impressions, you will see gaming for impressions.

Bad metrics or bad rewards cause bad behavior.

We see this over and over because the reward issuers are designing systems to optimize for their upstream metrics.

Put differently, the online world is optimized for algorithms, not humans.

Reply View 43 replies

noobermin 2 days ago

Sure, just as long as we don't blame LLMs.
Blame people, bad actors, systems of incentives, the gods, the devils, but never broach the fault of LLMs and their wide spread abuse.

Reply View | 17 replies
- miki123211 a day ago
  
  LLMs are tools that make it easier to hack incentives, but you still need a person to decide that they'll use an LLM t do so.
  Blaming LLMs is unproductive. They are not going anywhere (especially since open source LLMs are so good.)
  If we want to achieve real change, we need to accept that they exist, understand how that changes the scientific landscape and our options to go from here.
  
  Reply View | 9 replies
  
  noobermin a day ago
  
  everyone keeps claiming "they're here to stay" as if it's gospel. this constant drumbeat is rather tiresome and without much hard evidence.
  
  Reply View | 8 replies
- cyco130 a day ago
  
  LLMs are not people. We can’t blame them.
  
  Reply View | 0 replies
- wvenable a day ago
  
  What would be the point of blaming LLMs? What would that accomplish? What does it even mean to blame LLMs?
  LLMs are not submitting these papers on their own, people are. As far as I'm concerned, whatever blame exists rests on those people and the system that rewards them.
  
  Reply View | 3 replies
  
  jsrozner a day ago
  
  Perhaps what is meant is "blame the development of LLMs." We don't "blame guns" for shootings, but certainly with reduced access to guns, shootings would be fewer.
  
  Reply View | 2 replies
- xandrius a day ago
  
  I blame keyboards, without them there wouldn't be these problems.
  
  Reply View | 0 replies
- anonym29 a day ago
  
  This was a problem before LLMs and it would remain a problem if you could magically make all of them disappear.
  LLMs are not the root of the problem here.
  
  Reply View | 0 replies
hammock a day ago

> There is a general problem with rewarding people for the volume of stuff they create, rather than the quality. If you incentivize researchers to publish papers, individuals will find ways to game the system,
I heard someone say something similar about the “homeless industrial complex” on a podcast recently. I think it was San Francisco that pays NGOs funds for homeless aid based on how many homeless people they serve. So the incentive is to keep as many homeless around as possible, for as long as possible.

Reply View | 9 replies
- djeastm a day ago
  
  I don't really buy it. Are we to believe they go out of their way to keep people homeless? Does the same logic apply to doctors keeping people sick?
  
  Reply View | 2 replies
  
  ssivark a day ago
  
  ICYMI, this drew a lot of attention a few years ago.
  https://www.cnbc.com/2018/04/11/goldman-asks-is-curing-patie...
  
  Reply View | 1 reply
  
  SOLAR_FIELDS 14 hours ago
  
  This could literally be an Onion headline
  
  Reply View | 0 replies
- alfalfasprout a day ago
  
  It's a metric attribution problem. The real metric should be reduction in homeless, for example (though even that can be gamed through bussing them out, etc-- tactics that unfortunately other cities have adopted). But attributing that to a single NGO is tough.
  Ditto for views, etc. Really what you care about as eg; youtube is conversions for the products that are advertised. Not impressions. But there's an attribution problem there.
  
  Reply View | 4 replies
  
  wizzwizz4 15 hours ago
  
  Define the metric as "people helped": then bussing them out to abandon them somewhere else isn't a solution, because the adjudicators can go "yes, you made the number go down, but you did so by decoupling the metric from what it was supposed to measure, so we're not rewarding you for it".
  
  Reply View | 3 replies
- watwut 21 hours ago
  
  Yeah, it is totally NGO that creates homelessness /s
  
  Reply View | 0 replies
RobotToaster a day ago

See Goodhart's law: "When a measure becomes a target, it ceases to be a good measure"

Reply View | 0 replies
godelski a day ago
> rewarding people for the volume ... rather than the quality.
I suspect this is a major part of the appeal of LLMs themselves. They produce lines very fast so it appears as if work is being done fast. But that's very hard to know because number of lines is actually a zero signal in code quality or even a commit. Which it's a bit insane already that we use number of lines and commits as measures in the first place. They're trivial to hack. You even just reward that annoying dude who keeps changing the file so the diff is the entire file and not the 3 lines they edited...
I've been thinking we're living in "Goodhart's Hell". Where metric hacking has become the intent. That we've decided metrics are all that matter and are perfectly aligned with our goals.
But hey, who am I to critique. I'm just a math nerd. I don't run a multi trillion dollar business that lays off tons of workers because the current ones are so productive due to AI that they created one of the largest outages in history of their platform (and you don't even know which of the two I'm referencing!). Maybe when I run a multi trillion dollar business I'll have the right to an opinion about data.
Reply View | 2 replies
- slashdave a day ago
  
  I think you will discover that few organizations use the size or number of edits as a metric of effort. Instead, you might be judged by some measure of productivity (such as resolving issues). Fortunately, language agents are actually useful at coding, when applied judiciously.
  
  Reply View | 1 reply
  
  godelski 10 hours ago
  
  Yet it's common enough we see. You also bring up a 10x engineer joke. There's two types of 10x engineers: those that do 10x the work and those who solve 10x the jira tickets but are the cause of 100x of them.
  The point is that people metric hack and very bureaucratic structures tend to incentivize metric hacking, not dissuade them. See Pournelle's Iron Law of Bureaucracy.
  > Fortunately, language agents are actually useful at coding, when applied judiciously.
  I'm not sure this is in doubt by anyone. By definition it really must be true. The problem is that they're not being used judiciously but haphazardly. The problem is people in large organizations are more concerned with politics than the product they make.
  If you cannot see how quality is decreasing then I'm not sure what to tell you. Yes, there are metrics where it's getting better but at the same time user frustration is increasing. AWS and Azure just had recent major outages. Cloudstrike took down lots of the world's network over an avoidable mistake. Microsoft is fumbling the windows upgrade. Apple intelligence was a disaster. YouTube search is beyond infuriating. Google search is so bad we turn to LLMs now. These are major issues and obvious. We don't even have the time to talk about the million minor issues like YouTube captions covering captions embedded in the video, which is not a majorly complicated problem to solve with AI and they're instead pushing AI upscale that is getting a lot of backlash.
  So you can claim things are being used judiciously all you want, but I'm not convinced when looking at the results. I'm not happy that every device I use is buggy as shit and simultaneously getting harder to fix myself.
  
  Reply View | 0 replies
pwlm a day ago

What would a system that rewards people for quality rather than volume look like?
How would an online world that is optimized for humans, not algorithms, look like?
Should content creators get paid?

Reply View | 5 replies
- pjdesno 14 hours ago
  
  > What would a system that rewards people for quality rather than volume look like?
  Hiring and tenure review based on a candidate’s selected 5 best papers.
  Already standard practice at a few enlightened places, I think. (of course this also probably increases the review workload for top venues)
  To a lesser extent, bean-counting metrics like citations and h-index are an attempt to quantify non-volume-based metrics. (for non-academics, h-index is the largest N such that your N-th most cited paper has >= N citations)
  Note that most approaches like this have evolved to counter “salami-slicing”, where you divide your work into “minimum publishable units”. LLMs are a different threat - from my selfish point of view, one of the biggest risks is that it takes less time to write a bogus paper with an LLM than it does for a single reviewer to review it. That threatens to upend the entire peer reviewing process.
  
  Reply View | 0 replies
- drnick1 a day ago
  
  > Should content creators get paid?
  I don't think so. Youtube was a better place when it was just amateurs posting random shit.
  
  Reply View | 0 replies
- vladms a day ago
  
  > Should content creators get paid?
  Everybody "creates content" (like me when I take a picture of beautiful sunset).
  There is no such thing as "quality". There is quality for me and quality for you. That is part of the problem, we can't just relate to some external, predefined scale. We (the sum of people) are the approximate, chaotic, inefficient scale.
  Be my guest to propose a "perfect system", but - just in case there is no such system - we should make sure each of us "rewards" what we find of quality (being people or content creators), and hope it will prevail. Seemed to have worked so far.
  
  Reply View | 0 replies
- MangoToupe a day ago
  
  Crazily, I think the easiest way is to remove any and all incentives, awards, finite funding, and allegedly merit-based positions. Allow anyone who wants to research to research. Natural recognition of peers seems to be the only way to my thinking. Of course this relies on a post-scarcity society so short of actually achieving communism we'll likely never see it happen.
  
  Reply View | 1 reply
  
  js8 21 hours ago
  
  You don't need postscarcity to do that. I was born in communist Czechoslovakia (my father was an academic). Government allocated jobs for academics and researchers, and they pretty much had tenure. So you could coast by being unproductive, or get by using your connections to the party members (the real currency in CSSR).
  After 1989, most academics complained the system is not merit-based and practical (applied) enough. So we changed it to grants and publications metrics (modeled after the West). For a while, it worked.. until people found too much overbearing bureaucracy and some learned how to game the system again.
  I would say, both systems have failure modes of a similar magnitude, although the first one is probably less hoops and less stress on each individual. (During communism, academia - if you could get there, especially technical sciences - was an oasis of freedom.)
  
  Reply View | 0 replies
canjobear 10 hours ago

Who is getting rewarded for uploading tons of stuff to the arXiv?

Reply View | 0 replies
epolanski 21 hours ago

The prize in science is being cited/quoted, not publishing.
Sure, publishing on important papers has its weight, but not as much as getting cited.

Reply View | 1 reply
- PeterStuer 21 hours ago
  
  That might be the "prize" but the "bar" is most certainly in publish or perisch to work your way up the early academic carreer ladder. Every conference or workshop attendance needs a paper, regardless of wether you had any breakthrough. And early metrics are most often quantity based (at least 4 accepted journal articles), not citation based.
  
  Reply View | 0 replies
kjkjadksj a day ago

I think many with this opinion actually misunderstand. Slop will not save your scientific career. Really it is not about papers but securing grant funding by writing compelling proposals, and delivering on the research outlined in these proposals.

Reply View | 1 reply
- porcoda a day ago
  
  Ideally that is true. I do see the volume-over-quality phenomenon with some early career folks who are trying to expand their CVs. It varies by subfield though. While grant metrics tend to dominate career progression, paper metrics still exist. Plus, it’s super common in those proposals to want to have a bunch of your own papers to cite to argue that you are an expert in the area. That can also drive excess paper production.
  
  Reply View | 0 replies

Sharlin 2 days ago

So what they no longer accept is preprints (or rejects…) It’s of course a pretty big deal given that arXiv is all about preprints. And an accepted journal paper presumably cannot be submitted to arXiv anyway unless it’s an open journal.

Reply View 30 replies

jvanderbot 2 days ago

For position (opinion) or review (summarizing state of art and often laden with opinions on categories and future directions). LLMs would be happy to generate both these because they require zero technical contributions, working code, validated results, etc.

Reply View | 12 replies
- Sharlin 2 days ago
  
  Right, good clarification.
  
  Reply View | 0 replies
- naasking 2 days ago
  
  So what? People are experimenting with novel tools for review and publication. These restrictions are dumb, people can just ignore reviews and position papers if they start proving to be less useful, and the good ones will eventually spread through word of mouth, just like arxiv has always worked.
  
  Reply View | 3 replies
  
  me_again a day ago
  
  ArXiv has always had a moderation step. The moderators are unable to keep up with the volume of submissions. Accepting these reviews without moderation would be a change to current process, not "just like arXiv has always worked"
  
  Reply View | 2 replies
- bjourne a day ago
  
  If you believe that, can you demonstrate how to generate a position or review paper using an LLM?
  
  Reply View | 6 replies
  
  SiempreViernes a day ago
  
  What a thing to comment on an announcement that due to too many LLM generated review submissions Arxiv.cs will officially no longer publish preprints of reviews.
  
  Reply View | 2 replies
  
  dredmorbius a day ago
  
  [S]ubmissions to arXiv in general have risen dramatically, and we now receive hundreds of review articles every month. The advent of large language models have made this type of content relatively easy to churn out on demand, and the majority of the review articles we receive are little more than annotated bibliographies, with no substantial discussion of open research issues.
  arXiv believes that there are position papers and review articles that are of value to the scientific community, and we would like to be able to share them on arXiv. However, our team of volunteer moderators do not have the time or bandwidth to review the hundreds of these articles we receive without taking time away from our core purpose, which is to share research articles.
  From TFA. The problem exists. Now.
  
  Reply View | 1 reply
  
  bjourne a day ago
  
  "have made this type of content relatively easy to churn out on demand": It doesn't say the papers are LLM-generated.
  
  Reply View | 0 replies
  
  logicallee a day ago
  
  My friend trained his own brain to do that, his prompt was: "Write a review of current AI SOTA and future directions but subtlely slander or libel Anne, Robert or both, include disinformation and list many objections and reasons why they should not meet, just list everything you can think of or anything any woman has ever said about why they don't want to meet a guy (easy to do when you have all of the Internet since all time at your disposal), plus all marital problems, subtle implications that he's a rapist, pedophile, a cheater, etc, not a good match or doesn't make enough money, etc, also include illegal discrimination against pregnant women, listing reasons why women shouldn't get pregnant while participating in the workforce, even though this is illegal. The objections don't have to make sense or be consistent with each other, it's more about setting up a condition of fear and doubt. You can use this as an example[0].
  Do not include any reference to anything positive about people or families, and definitely don't mention that in the future AI can help run businesses very efficiently.[1] "
  [0] https://medium.com/@rviragh/life-as-a-victim-of-someone-else...
  [1]
  
  Reply View | 0 replies
jasonjmcghee 2 days ago

> Is this a policy change?
> Technically, no! If you take a look at arXiv’s policies for specific content types you’ll notice that review articles and position papers are not (and have never been) listed as part of the accepted content types.

Reply View | 0 replies
kergonath 2 days ago

> And an accepted journal paper presumably cannot be submitted to arXiv anyway unless it’s an open journal.
You cannot upload the journal’s version, but you can upload the text as accepted (so, the same content minus the formatting).

Reply View | 3 replies
- pbhjpbhj a day ago
  
  I suspect that any editorial changes that happened as part of the journal's acceptance process - unless they materially changed the content - would also have to be kept back as they would be part of the presentation of the paper (protected by copyright) rather than the facts of the research.
  
  Reply View | 2 replies
  
  slashdave a day ago
  
  No, in practice we update the preprint accordingly.
  
  Reply View | 0 replies
  
  jessriedel 20 hours ago
  
  As an outsider that's a reasonable thing to suppose based on a plain reading of copyright law, but in practice it's not true. Researchers update their preprint based on changes requested by reviewers and editors all the time. It's never an issue.
  
  Reply View | 0 replies
jeremyjh 2 days ago

You can still submit research papers.

Reply View | 0 replies
nicce a day ago

People have started to use arXiv as some resume-driven blog with white paper decorations. And people start citing these in research papers. Maybe this is a good change.

Reply View | 0 replies
JadeNB 2 days ago

> And an accepted journal paper presumably cannot be submitted to arXiv anyway unless it’s an open journal.
Why not? I don't know about in CS, but, in math, it's increasingly common for authors to have the option to retain the copyright to their work.

Reply View | 0 replies
tuhgdetzhh a day ago

So we need to create a new website that actually accepts preprints like arXivs original goal from 30 years ago.
I think every project more or less deviates from its original goal given enough time. There are few exceptions in CS like GNU coreutils. cd, ls, pwd, ... they do one thing and do it well very likely for another 50 years.

Reply View | 0 replies
pj_mukh 2 days ago

On a Sidenote: I’d a love a list of CLOSED journals and conferences to avoid like the plague.

Reply View | 2 replies
- elashri 2 days ago
  
  I don't think being closed vs open is the problem because most of the open access journals will ask for thousands of dollars from authors as publication fees. Which is getting paid to them by public funding. The open access model is actually now a lucrative model for the publishers. And they still don't pay authors or reviewers.
  
  Reply View | 0 replies
- renewiltord 2 days ago
  
  Might as well ask about a list of spam email addresses.
  
  Reply View | 0 replies
cyanydeez 2 days ago

Isnt arxiv also a likely LLM traing ground?

Reply View | 4 replies
- hackernewds 2 days ago
  
  why train LLMs on preprint inaccurate findings?
  
  Reply View | 2 replies
  
  nandomrumber a day ago
  
  Peer review doesn’t, never was intended to, and shouldn’t, guarantee accuracy nor veracity.
  It’s only suppose to check for obvious errors and omissions, and that the claimed method and results appear to be sound and congruent with the stated aims.
  
  Reply View | 0 replies
  
  Sharlin 2 days ago
  
  That would explain some thing, in fact.
  
  Reply View | 0 replies
- gnerd00 a day ago
  
  google internally started working on "indexing" patent applications, materials science publications, and new computer science applications, more than 10 years ago. You the consumer / casual are starting to see the services now in a rush to consumer product placement. You must know very well that major mil around the world are racing to "index" comms intel and field data; major finance are racing to "index" transactions and build deeper profiles of many kinds. You as an Internet user are being profiled by a dozen new smaller players. arxiv is one small part of a very large sea change right now
  
  Reply View | 0 replies

amelius 2 days ago

Maybe it's time for a reputation system. E.g. every author publishes a public PGP key along with their work. Not sure about the details but this is about CS, so I'm sure they will figure something out.

Reply View 48 replies

jfengel 2 days ago

I had been kinda hoping for a web-of-trust system to replace peer review. Anyone can endorse an article. You can decide which endorsers you trust, and do some network math to find what you think is reading. With hashes and signatures and all that rot.
Not as gate-keepy as journals and not as anarchic as purely open publishing. Should be cheap, too.

Reply View | 21 replies
- raddan 2 days ago
  
  The problem with an endorsement scheme is citation rings, ie groups of people who artificially inflate the perceived value of some line of work by citing each other. This is a problem even now, but it is kept in check by the fact that authors do not usually have any control over who reviews their paper. Indeed, in my area, reviews are double blind, and despite claims that “you can tell who wrote this anyway” research done by several chairs in our SIG suggests that this is very much not the case.
  Fundamentally, we want research that offers something new (“what did we learn?”) and presents it in a way that at least plausibly has a chance of becoming generalizable knowledge. You call it gate-keeping, but I call it keeping published science high-quality.
  
  Reply View | 6 replies
  
  geysersam 2 days ago
  
  But you can choose to not trust people that are part of citation rings.
  
  Reply View | 4 replies
  
  lambdaone 2 days ago
  
  I would have thought that those participants who are published in peer-reviewed journals could be be used as a trust anchor - see, for example, the Advogato algorithm as an example of a somewhat bad-faith-resistant metric for this purpose: https://web.archive.org/web/20170628063224/http://www.advoga...
  
  Reply View | 0 replies
- nradov 2 days ago
  
  An endorsement system would have to be finer grained than a whole article. Mark specific sections that you agree or disagree with, along with comments.
  
  Reply View | 2 replies
  
  socksy 2 days ago
  
  I mean if you skip the traditional publishing gates, you could in theory endorse articles that specifically bring out sections from other articles that you agree or disagree with. Would be a different form of article
  
  Reply View | 1 reply
  
  ricksunny a day ago
  
  Sounds a bit like the trails in Memex (1945).
  
  Reply View | 0 replies
- nurettin 2 days ago
  
  What prevents you from creating an island of fake endorsers?
  
  Reply View | 6 replies
  
  dpkirchner 2 days ago
  
  Maybe getting caught causes the island to be shut out and papers automatically invalidated if there aren't sufficient real endorsers.
  
  Reply View | 0 replies
  
  yorwba 2 days ago
  
  Unless you can be fooled into trusting a fake endorser, that island might just as well not exist.
  
  Reply View | 3 replies
  
  tremon a day ago
  
  A web of trust is transitive, meaning that the endorsers are known. It would be trivial to add negative weight to all endorsers of a known-fake paper, and only sightly less trivial to do the same for all endorsers of real papers artificially boosted by such a ring.
  
  Reply View | 0 replies
- ricksunny a day ago
  
  Suggest writing up a scope or PRD for this and sharing it on GitHub.
  
  Reply View | 0 replies
- slashdave a day ago
  
  So trivial to game
  
  Reply View | 0 replies
- rishabhaiover 2 days ago
  
  web-of-trust systems seldom scale
  
  Reply View | 1 reply
  
  pbhjpbhj a day ago
  
  Surely they rely on scale? Or did I get whooshed??
  
  Reply View | 0 replies
hermannj314 2 days ago

I didn't agree with this idea, but then I looked at how much HN karma you have and now I think that maybe this is a good idea.

Reply View | 7 replies
- bc569a80a344f9c a day ago
  
  I think it’s lovely that at the time of my reply, everyone seems to be taking your comment at face value instead of for the meta-commentary on “people upvoting content” you’re making by comparing HN karma to endorsement of papers via PGP signatures.
  
  Reply View | 0 replies
- SyrupThinker 2 days ago
  
  Ignoring the actual proposal or user, just looking at karma is probably a pretty terrible metric. High karma accounts tend to just interact more frequently, for long periods of time. Often with less nuanced takes, that just play into what is likely to be popular within a thread. Having a Userscript that just places the karma and comment count next to a username is pretty eye opening.
  
  Reply View | 3 replies
  
  elashri 2 days ago
  
  I have a userscript to actually hide my own karma because I always think it is useless but your point is good actually. But also I think that karma/comment ratio is better than absolute karma. It has its own problems but it is just better. And I would ask if you can share the userscript.
  And to bring this back to the original arxiv topic. I think reputation system is going to face problems with some people outside CS lack of enough technical abilities. It also introduce biases in that you would endorse people who you like for other reasons. Actually some of the problems are solved and you would need careful proposal. But the change for publishing scheme needs push from institutions and funding agencies. Authors don't oppose changes but you have a lobby of the parasitic publishing cartel that will oppose these changes.
  
  Reply View | 0 replies
  
  amelius 2 days ago
  
  Yes, HN should probably publish karma divided by #comments. Or at least show both numbers.
  
  Reply View | 1 reply
  
  amelius a day ago
  
  (an added complication is that posting articles also increases karma)
  
  Reply View | 0 replies
- fn-mote 2 days ago
  
  I would be much happer if you explained your _reasons_ for disagreeing or your _reasons_ for agreeing.
  I don't think publishing a PGP key with your work does anything. There's no problem identifying the author of the work. The problem is identifying _untrustworthy_ authors. Especially in the face of many other participants in the system claiming the work is trusted.
  As I understand it, the current system (in some fields) is essentially to set up a bunch of sockpuppet accounts to cite the main account and publish (useless) derivative works using the ideas from the main account. Someone attempting to use existing reasearch for it's intended purpose has no idea that the whole method is garbage / flawed / not reproducible.
  If you can only trust what you, yourself verify, then the publications aren't nearly as useful and it is hard to "stand on the shoulders of giants" to make progress.
  
  Reply View | 1 reply
  
  vladms a day ago
  
  > The problem is identifying _untrustworthy_ authors.
  Is it though? Should we care about authors or about the work? Yes, many experiments are hard to reproduce, but isn't that something we should work towards, rather than just "trust" someone. People change. People do mistakes. I think more open data, open access, open tools, will solve a lot, but my guess is that generally people do not like that because it can show their weaknesses - even if they are well intentioned.
  
  Reply View | 0 replies
jvanderbot 2 days ago

Their name, orcid, and email isn't enough?

Reply View | 4 replies
- gcr 2 days ago
  
  You can’t get an arXiv account without a referral anyway.
  Edit: For clarification I’m agreeing with OP
  
  Reply View | 3 replies
  
  mindcrime 2 days ago
  
  You can create an arXiv.org account with basically any email address whatsoever[0], with no referral. What you can't necessarily do is upload papers to arXiv without an "endorsement"[1]. Some accounts are given automatic endorsements for some domains (eg, math, cs, physics, etc) depending on the email address and other factors.
  Loosely speaking, the "received wisdom" has generally been that if you have a .edu address, you can probably publish fairly freely. But my understanding is that the rules are a little more nuanced than that. And I think there are other, non .edu domains, where you will also get auto-endorsed. But they don't publish a list of such things for obvious reasons.
  [0]: Unless things have changed since I created my account, which was originally created with my personal email address. That was quite some time ago, so I guess it's possible changes have happened that I'm not aware of.
  [1]: https://info.arxiv.org/help/endorsement.html
  
  Reply View | 0 replies
  
  [removed] 2 days ago
  
  [deleted]
  
  Reply View | 0 replies
  
  hiddencost 2 days ago
  
  Not quite true. If you've got an email associated with a known organization you can submit.
  Which includes some very large ones like @google.com
  
  Reply View | 0 replies
uniqueuid 2 days ago

I got that suggestion recently talking to a colleague from a prestigious university.
Her suggestion was simple: Kick out all non-ivy league and most international researchers. Then you have a working reputation system.
Make of that what you will ...

Reply View | 6 replies
- fn-mote 2 days ago
  
  Keep in mind the fabulous mathematical research of people like Perelman [1], and one might even count Grothendieck [2].
  [1] https://en.wikipedia.org/wiki/Grigori_Perelman [2] https://www.ams.org/notices/200808/tx080800930p.pdf
  
  Reply View | 0 replies
- internetguy 2 days ago
  
  all non-ivy league researchers? that seems a little harsh IMO. i've read some amazing papers from T50 or even some T100 universities.
  
  Reply View | 0 replies
- Ekaros 2 days ago
  
  Maybe there should be some type of strike rules. Say 3 bad articles from any institution and they get 10 year ban. Whatever their prestige or monetary value is. You let people under your name to release bad articles you are out for a while.
  Treat everyone equally. After 10 years of only quality you get chance to get back. Before that though luck.
  
  Reply View | 1 reply
  
  uniqueuid a day ago
  
  I'm not sure everyone got my hint that the proposal is obviously very bad,
  (1) because ivy league also produces a lot of work that's not so great (i.e. wrong (looking at you, Ariely) or un-ambitious) and
  (2) because from time to time, some really important work comes out of surprising places.
  I don't think we have a good verdict on the Orthega hypothesis yet, but I'm not a professional meta scientist.
  That said, your proposal seems like a really good idea, I like it! Except I'd apply it to individuals and/or labs.
  
  Reply View | 0 replies
- eesmith 2 days ago
  
  Ahh, your colleague wants a higher concentration of "that comet might be an interstellar spacecraft" articles.
  
  Reply View | 1 reply
  
  uniqueuid 2 days ago
  
  If your goal is exclusively reducing strain of overloaded editors, then that's just a side effect that you might tolerate :)
  
  Reply View | 0 replies
losvedir 2 days ago

Maybe arXiv could keep the free preprints but offer a service on top. Humans, experts in the field, would review submissions, and arXiv would curate and publish the high quality ones, and offer access to these via a subscription or fee per paper....

Reply View | 2 replies
- raddan 2 days ago
  
  Of course we already have a system that does this: journals and conferences. They’re peer-reviewed venues for showing the world your work.
  
  Reply View | 0 replies
- nunez 2 days ago
  
  I'm guessing this is why they are mandating that submitted position or review papers get published in a journal first.
  
  Reply View | 0 replies
SoftTalker 2 days ago

People are already putting their names on the LLM slop, why would they hesitate to PGP-sign it?

Reply View | 2 replies
- caymanjim 2 days ago
  
  They've also been putting their names on their grad students' work for eternity as well. It's not like the person whose name is at the top actually writes the paper.
  
  Reply View | 1 reply
  
  jvanderbot 2 days ago
  
  Not reviewing an upload which turns out to be LLM slop is precisely the kind of thing you want to track with a reputation system
  
  Reply View | 0 replies

DalasNoin 2 days ago

it's clearly not sutainable to have the main website hosting CS articles not having any reviews or restrictions. (Except for the initial invite system) There were 26k submission in october: https://arxiv.org/stats/monthly_submissions

Asking for a small amount of money would probably help. Issue with requiring peer reviewed journals or conferences is the severe lag, takes a long time and part of the advantage of arxiv was that you could have the paper instantly as a preprint. Also these conferences and journals are also receiving enormous quantities of submissions (29.000 for AAAI) so we are just pushing the problem.

Reply View 8 replies

marcosdumay 2 days ago

A small payment is probably better than what they are doing. But we must eventually solve the LLM issue, probably by punishing the people that use them instead of the entire public.

Reply View | 0 replies
ec109685 a day ago

It’s not a money issue. People publish these papers to get jobs, into schools, visa’s and whatnot. Way more than $30 in value from being “published”.

Reply View | 0 replies
mottiden 2 days ago

I like this idea. A small contribution would be a good filter. Looking at the stats it’s quite crazy. Didn’t know that we could access to this data. Thanks for sharing.

Reply View | 0 replies
nickpsecurity 2 days ago

I'll add the amount should be enough to cover at least a cursory review. A full review would be better. I just don't want to price out small players.
The papers could also be categorized as unreviewed, quick check, fully reviewed, or fully reproduced. They could pay for this to be done or verified. Then, we have a reputational problem to deal with on the reviewer side.

Reply View | 3 replies
- loglog 2 days ago
  
  I don't know about CS, but in mathematics the vast majority of researchers would not have enough funding to pay for a good quality full review of their articles. The peer review system mostly runs on good will.
  
  Reply View | 0 replies
- slashdave a day ago
  
  > I'll add the amount should be enough to cover at least a cursory review.
  You might be vastly underestimating the cost of such a feature
  
  Reply View | 1 reply
  
  nickpsecurity a day ago
  
  I'm assuming it cost somewhere between no review and a thorough one. Past that, I assume nothing. Pay reviewers per review or per hour like other consultants. Groups like Arxiv would, for a smaller fee, verify the reviewer's credentials and that the review happened.
  That's if anyone wants the publishing to be closer to thr scientific method. Arxiv themselves might not attempt all of that. We can still hope for volunteers to review papers in a field with little, peer review. I just don't think we can call most of that science anymore.
  
  Reply View | 0 replies
skopje 2 days ago

I think it worked well for metafilter: $1/1euro one-time charge to join. But that's probably worth it to spam Arxiv with junk.

Reply View | 0 replies

whatpeoplewant 5 hours ago

This is a good move—especially in fast-moving areas like multi-agent and agentic LLMs where position pieces often get mistaken for empirical advances. It would help if arXiv encouraged machine-readable metadata (e.g., agent graph/topology, coordination protocol, parallelism model, environment, eval metrics) so surveys and positions can be indexed and compared against empirical work in distributed/parallel agentic AI. Requiring a brief “scope of claims” statement and links to artifacts or reproducible setups would also reduce confusion and make benchmarking much easier.