CSPaper: peer review sidekick

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Artificial intelligence & Machine Learning

Discuss peer review challenges in AI/ML research — submission, review quality, bias, and decision appeals at ICLR, ICML, NeurIPS, AAAI, IJCAI, AISTATS and COLT.

This category can be followed from the open social web via the handle ai-ml@forum.cspaper.org

59 Topics 197 Posts

R

AAAI 2026’s Two-Phase Turbulence: When Reviewer 2 Meets Reviewer GPT
Watching Ignoring Scheduled Pinned Locked Moved aaai 2026 peer review decision accept reject debate submission rebuttal reviewer2
1

1

0 Votes

1 Posts

414 Views

No one has replied
R

How an ICLR Reviewer Reads Your Paper — Practical Tips to Lift Your Acceptance Odds
Watching Ignoring Scheduled Pinned Locked Moved iclr 2026 review peer review novelty depth presentation reproducibility habit pitfall
1

1

0 Votes

1 Posts

458 Views

No one has replied
J

🌊 Submission Tsunami at NeurIPS 2025: Is Peer Review About to Collapse?
Watching Ignoring Scheduled Pinned Locked Moved
4

1

0 Votes

4 Posts

3k Views

R

@Tari-S Thanks for pointing out the confusion. I'd like to clarify the diff between calendar hours and full-time work hours (FTE). A person-year in reviewing is typically ~2,000–2,080 work hours, not 8,760 calendar hours. Three scenario: Skim pass (30 min/paper): 25,000 × 0.5 h = 12,500 h → 6.0–6.5 FTE-years (12,500 ÷ 2,080 ≈ 6.01; ÷ 2,000 = 6.25). Single in-depth review (2 h/paper): 25,000 × 2 h = 50,000 h → 24.0–26.0 FTE-years (not 50+ yet). What would make it 50+? Typical programs use ≥3 reviews/paper. 3 × 2 h = 6 h/paper ⇒ 25,000 × 6 h = 150,000 h → 72–78 FTE-years. Add meta-review/AC time (≈0.5–1 h/paper) and you’re at ~78–91 FTE-years. So: “1.4 person-years” uses the wrong denominator (calendar hours). “50+ person-years” is too high for a single pass, but reasonable (and actually conservative) once you include multiple reviews and overhead.
R

📈 ICLR 2026 Submission Storm: 19,797 Papers, New Review Rules, and the Age of AI Convergence
Watching Ignoring Scheduled Pinned Locked Moved iclr 2026 submission peer review topics trend brazil theme llm cspaper
1

1

0 Votes

1 Posts

6k Views

No one has replied
C

🎉 AAAI 2026 Review Results & Rebuttal Phase — What You Need to Know!
Watching Ignoring Scheduled Pinned Locked Moved aaai 2026 rebuttal review result area chair spc decision cvpr resubmission average score
1

1

0 Votes

1 Posts

1k Views

No one has replied
R

NeurIPS 2025 Official Sharing: What Changed, What Worked, and What’s Next
Watching Ignoring Scheduled Pinned Locked Moved neurips 2025 blog position track main track dataset benchmarking calibration acceptance rate responsible
1

1

1 Votes

1 Posts

623 Views

No one has replied
R

ICLR 2026: Submissions, LLM Disclosures, and the Peer Review Shuffle
Watching Ignoring Scheduled Pinned Locked Moved iclr 2026 review submission checklist llm disclosure neurips research bidding reviewers
1

1

0 Votes

1 Posts

1k Views

No one has replied
S

When Acceptance Isn’t Enough: NeurIPS 2025 rejects 400 accepted papers due to venue crisis?
Watching Ignoring Scheduled Pinned Locked Moved neurips 2025 aaai 2026 sac area chair program chair rejection venue constraints
3

1

0 Votes

3 Posts

11k Views

S

@Roman Thanks for your thoughtful perspective. I think you’ve highlighted some of the most pressing tensions here. I agree with you that sheer volume is the root driver of this breakdown. As you say, splitting into sub-venues or dual-location conferences doesn’t really shift the ratio of submissions to reviewers; it just redistributes the same workload. On your point (1), capping the number of submissions per first author: while it could help, I do worry it risks disproportionately affecting early-career researchers who often experiment with multiple directions. But perhaps a more nuanced policy could be considered? for example, scaling expectations differently for student authors vs. senior authors. On (2), extending the review period with more dialogue ... I think this would be hugely valuable. Rebuttals often feel compressed, and genuine discussion could help both sides. Of course, that requires balancing timelines with the demands of conference planning, but it seems like one of the most constructive levers we could realistically pull. As for AI-assisted reviews, you’re right that trust is the backbone. I see them less as a replacement and more as scaffolding: useful for flagging inconsistencies, summarizing discussions, or spotting hallucinated citations — but never substituting for a human’s final judgment. AAAI’s experiment will be interesting to watch, though the challenge will be transparency: authors should know which parts of the review were machine-augmented. Ultimately, maybe the problem isn’t just reviewer workload but also our collective reliance on a few “gatekeeper” venues. Until we diversify both where and how impactful work is recognized, these cycles of overload may keep repeating. Curious what you think ... do we need systemic alternatives beyond just fixing NeurIPS/AAAI, or is the first step still making the existing model more sustainable?
R

NeurIPS 2025 Decision Storm: When Full Scores Still Mean Rejection
Watching Ignoring Scheduled Pinned Locked Moved neurips 2025 decision iclr cvpr 2026 accept reject sac poster
1

2

0 Votes

1 Posts

3k Views

No one has replied
R

AAAI 2026 First-Round Results: High Scores Rejected, AI Reviews Useful?
Watching Ignoring Scheduled Pinned Locked Moved aaai reject decision first phase www scores submission iclr accept cvpr
1

6

1 Votes

1 Posts

2k Views

No one has replied
J

AAAI First to Pilots AI-Assisted Peer Review, Stirring Global Academia
Watching Ignoring Scheduled Pinned Locked Moved aaai 2025 2026 ai review peer review
3

1

1 Votes

3 Posts

1k Views

R

I want to add a bit of my reflection on AI Review Potential Strengths Scalability and Efficiency: AI systems could assist in managing the ever-growing number of submissions, reducing workload for human reviewers and accelerating review timelines. Consistency and Standardization: Automated systems can enforce uniform criteria, potentially reducing variance caused by subjective or inconsistent human judgment. Augmented Support for Humans: AI could provide structured summaries, highlight methodological issues, or retrieve related prior work, acting as a co-pilot rather than a replacement for human reviewers. Transparency and Traceability: With criterion-aligned or structured outputs, AI systems might make explicit how particular aspects of a paper were evaluated, offering traceability that complements human interpretation. Concerns and Limitations Quality and Depth of Judgment: Peer review is not just about summarization or surface-level critique. Human reviewers often contribute domain expertise, intuition, and contextual reasoning that AI currently struggles to replicate. Evaluation Metrics Misalignment: Using overlap-based metrics (e.g., ROUGE, BERTScore) may not fully capture the nuanced quality of reviews, which often rely on critical reasoning and qualitative assessment. Dataset and Generalizability Issues: Many experiments in this space rely on small or narrow datasets (e.g., limited to certain conferences), which risks overfitting and reduces generalizability to other domains. Reproducibility and Fairness: Reliance on proprietary large language models introduces cost, access, and reproducibility challenges. Comparisons across different model sizes or modalities can also create fairness concerns. Multimodality and Context Handling: While AI can parse text and visuals, questions remain about whether figures, tables, and extended contexts truly require specialized handling beyond what modern large-context models can already process. Ethical and Practical Considerations Human Replacement vs. Human Augmentation: A key concern is whether AI should replace reviewers or assist them. Many argue for augmentation rather than substitution, especially given the subjective and community-driven nature of peer review. Bias and Trust: AI-generated reviews may inherit biases from training data or evaluation frameworks, raising questions about fairness and transparency in decision-making. Cost and Sustainability: Running AI review systems at scale may incur significant computational and financial costs, particularly when leveraging closed, high-capacity models. Accountability: Unlike human reviewers, AI systems cannot be held accountable for their judgments, which complicates trust and governance in academic publishing. Emerging Attitudes Skepticism: Many scholars remain unconvinced that AI can capture the essence of peer review, viewing it as reductionist or superficial. Cautious Optimism: Some see AI as a promising assistant to support human reviewers, especially for summarization, consistency checks, or initial screening. Call for Rigor: There is a consensus that human evaluation, broader benchmarking, and careful methodological design are critical before integrating AI into the peer review process at scale. In summary: The use of AI in peer review is seen as an intriguing and potentially useful tool for augmentation, but concerns around motivation, evaluation validity, fairness, and the irreplaceable role of human judgment dominate current attitudes. There is strong agreement that more rigorous evidence and careful deployment strategies are needed before AI can play a central role in scholarly reviewing.
N

CVPR 2026 Starts Reviewer Recruitment – Who’s In?
Watching Ignoring Scheduled Pinned Locked Moved cvpr 2026 reviewer ieee cvf timeline responsibility key dates conference program chair
1

0 Votes

1 Posts

1k Views

No one has replied
R

🚨 AAAI 2026 is here... and the chaos begins! 🚨
Watching Ignoring Scheduled Pinned Locked Moved aaai 2026 review rebuttal discussion decision submission
4

1

1 Votes

4 Posts

2k Views

R

Looks like the AAAI-26 rollercoaster isn’t slowing down anytime soon. [image: 1757459547907-685cb90de9969beg.jpeg] Both the latest notices confirm what many of us have been feeling: the review timeline is slipping yet again. Originally, Phase 1 notifications were due Sept 8. Then the OpenReview crash pushed it to Sept 12. Now the official word is Sept 15. August 4, 2025 Supplementary material and code due by 11:59 PM UTC-12 September 15, 2025 Notification of Phase 1 rejections October 7–13, 2025 Author feedback window November 8, 2025 Notification of final acceptance or rejection (Main Technical Track) That three-day “extra wait” might not sound like much, but for authors it’s brutal. It means: Suspended in mid-air: checking inbox + OpenReview every morning, still no word. Rebuttal delayed: the author response window got pushed past the October holidays — which makes sense only if reviews themselves aren’t ready. Compressed transfer time: folks planning to bounce to ICLR 2026 if rejected are losing precious prep days. With ICLR abstract due Sept 19 and full paper Sept 24, every delay cuts deep. To add spice, the program chairs hinted the Phase 1 rejection rate could hit 50–67%. That means only ~1/3 of submissions will survive past the first cut. With nearly 29,000+ papers in the system — more than double last year — the scale is unprecedented . The bigger picture Emergency “last-minute reviewers” are being pulled in to cover gaps. Other conferences are also bending: NeurIPS’s “dual-city” experiment saw accepted papers later force-rejected due to quota caps. The pattern is clear: our current peer review model is hitting a breaking point. Technical crashes, reviewer overload, rebuttals turning into vent sessions — all signs of strain. Open questions for us as a community Do we just accept longer waits and higher rejection odds as the new normal? Should AAAI (and other big A* conferences) move toward dynamic, rolling review models rather than single-shot deadlines? Or do we need to rethink reciprocal review obligations more fundamentally — to balance load without roulette-style assignments? For now, all we can do is hang tight until Sept 15 ( no more extensions). But honestly, given the trajectory, I wouldn’t be surprised if “Sept 15” becomes “Sept whenever.” Anyone here already prepping ICLR as a fallback? Or are you holding out for the rebuttal round?
S

🚨AAAI & NeurIPS’ Ruthless Early Rejection Waves: A Perfect Storm Brewing for ICLR 2026?
Watching Ignoring Scheduled Pinned Locked Moved aaai neurips 2025 reject rejection iclr 2026 review submission transparency
2

0 Votes

2 Posts

6k Views

L

Any source on this one? AAAI has not yet released anything...
C

🚨 ICLR Under Fire: How Did a Student With *No Top-Tier Publications* Become an Area Chair?
Watching Ignoring Scheduled Pinned Locked Moved iclr 2026 area chair cspaper qualification crisis academic integrity loophole neurips icml
1

1

0 Votes

1 Posts

2k Views

No one has replied
R

ICLR 2026’s “LLM Confession Rule”: No Disclosure, No Paper
Watching Ignoring Scheduled Pinned Locked Moved disclose llm iclr 2026 paper submission neurips 2025 code of ethics gpt adam
1

1

0 Votes

1 Posts

1k Views

No one has replied
R

“High-Score Rejection” Controversy from an AC of NeurIPS 2025
Watching Ignoring Scheduled Pinned Locked Moved neurips 2025 rejection acceptance rate area chair program chair sac review rebuttal score
1

1

1 Votes

1 Posts

1k Views

No one has replied
R

AAAI-26 Review Process Update: Scale, Integrity Measures, and Pathways to Sustainability
Watching Ignoring Scheduled Pinned Locked Moved aaai 2026 community conference review scale paper submission area chair committee
2

1

1 Votes

2 Posts

724 Views

N

Is CSPaper (https://review.cspaper.org) collaborated with AAAI's new AI review initiative?
C

Open Reviewing in Machine Learning: A New Community Survey for ICLR 2025
Watching Ignoring Scheduled Pinned Locked Moved iclr survey 2013 2025 neurips peer review openreview icml reviewer anonymity closed reviewing
1

1

1 Votes

1 Posts

455 Views

No one has replied
R

🚨 The AI Conference Bubble is About to Burst: "Publish or Perish" at the Edge of Collapse
Watching Ignoring Scheduled Pinned Locked Moved neurips 2025 2040 icml iclr conference phd peer review crisis burnout
1

4

0 Votes

1 Posts

853 Views

No one has replied