Structuring Research Papers

Academic papers follow a standard structure called IMRaD: Introduction, Methods, Results, and Discussion. This guide teaches you how to write each component effectively for machine learning research.

Introduction (1-1.5 pages)

The introduction should motivate your work and establish the research gap.

Standard Structure

Motivation (1-2 paragraphs)
- Why does this problem matter?
- What are the real-world implications?
- Example: “Emergency department overcrowding leads to delayed care and worse patient outcomes…”
Gap (1 paragraph)
- What’s missing in current work?
- What problem are you solving?
- Example: “Existing models use only structured EHR data, ignoring rich patient-reported information…”
Contributions (1 paragraph)
- What does your work provide?
- What are the key innovations?
- Example: “We propose a multimodal transformer that fuses structured data, text descriptions, and visual inputs…”
Results Preview (1 paragraph)
- Tease main findings
- Example: “Our approach achieves 6% improvement over baseline models on outcome prediction…”

Writing Tips

Start broad, narrow down: Global problem → specific gap → your solution
Hook the reader: First sentence should grab attention
Be concise: Every sentence should have a purpose
Tell a story: Guide the reader through your logic

Organize by themes, not chronologically.

Organization by Theme

Transformers for Healthcare: Healthcare-specific transformer models (ETHOS, BEHRT)
Multimodal AI: Multimodal foundations, CLIP
Prediction Models: Existing outcome prediction systems
Vision-Language Models: Medical imaging + text applications

For Each Paper

What did they do? Brief summary of approach
How is your work different? Contrast with your method
Build a narrative: Connect papers to tell a story

Example Paragraph Structure

“Recent work has applied transformers to structured healthcare data [1,2,3]. ETHOS [1] uses masked event modeling on EHR sequences, achieving state-of-the-art results on mortality prediction. However, these approaches ignore unstructured patient-reported data. Meanwhile, multimodal models have shown promise in general domains [4,5], but haven’t been adapted to healthcare prediction. Our work bridges this gap by…”

Methods (2-3 pages)

The guiding principle: Reproducibility. Another researcher should be able to reproduce your work from this section.

Essential Components

Data
- Dataset description
- Preprocessing steps
- Train/val/test splits
- Example: “We used 50,000 ED visits from Hospital X (2020-2024), split temporally…”
Model Architecture
- Full specification with diagram
- Component details
- Example:


Our multimodal model consists of three encoders:

**EHR Encoder**: 12-layer transformer (d_model=768, n_heads=12)
pre-trained on 8M ED visits using masked event modeling.

**Text Encoder**: ClinicalBERT fine-tuned on symptom descriptions.

**Sketch Encoder**: ResNet-50 pre-trained on ImageNet.

Fusion via cross-attention (8 heads, 512-dimensional output).

Training
- Optimizer and learning rate
- Batch size and number of epochs
- Hardware details
- Example: “Adam optimizer (lr=1e-4), batch size 32, 50 epochs on A100 GPU”
Baselines
- How you implemented comparisons
- Hyperparameter choices
- Example: “ETHOS baseline trained with official implementation…”
Evaluation
- Metrics and why you chose them
- Statistical tests
- Example: “We report AUROC, AUPRC, and calibration. Significance tested with paired t-test.”

Methods Checklist

Data source and size clearly stated
Preprocessing steps documented
Model architecture fully specified
Hyperparameters listed
Training procedure described
Hardware/software environment documented
Baselines clearly defined
Evaluation metrics justified

Results (2-3 pages)

Present findings clearly with tables and figures.

Essential Components

Main Results Table
- Full model vs all baselines
- Multiple metrics
- Confidence intervals

Model	AUROC	AUPRC	Accuracy	F1
Logistic Regression	0.72 ± 0.02	0.45 ± 0.03	0.68	0.55
ETHOS	0.83 ± 0.01	0.61 ± 0.02	0.79	0.71
Our Model (Full)	0.89 ± 0.01	0.72 ± 0.02	0.84	0.79

Ablation Study
- Contribution of each component
- Systematic removal analysis
Qualitative Analysis
- Attention visualizations (attention mechanism)
- Case studies
- Error analysis
Statistical Tests
- P-values for comparisons
- Significance statements
- Example: “Our model significantly outperforms ETHOS (p < 0.001)“

Figure Guidelines

Clear captions: Should be self-contained
Referenced in text: “As shown in Figure 2…”
Color-blind friendly: Use palettes like viridis
Readable fonts: Minimum 10pt
High resolution: At least 300 DPI for print

What to Include/Exclude

Include:

Results that answer your research question
Ablation studies showing component contributions
Error analysis showing where model fails
Comparisons to all baselines

Exclude (save for appendix):

Every single experiment you ran
Negative results that don’t inform the story
Redundant visualizations

Discussion (1-2 pages)

Interpret your results and discuss implications.

Standard Structure

Main Findings (1 paragraph)
- Summarize key results
- Example: “Multimodal fusion improved AUROC by 6% over EHR-only models…”
Why It Works (1-2 paragraphs)
- Mechanism explanation
- Example: “We hypothesize that patient-reported symptoms capture subjective experiences missing from structured data…”
Limitations (1 paragraph)
- Be honest about weaknesses
- Show you understand your work critically
Future Work (1 paragraph)
- What’s next?
- Open questions
Broader Impact (1 paragraph)
- Clinical implications
- Societal considerations

Limitations Example

Be specific and honest:

“Our study is limited by a single-site dataset (N=50,000 from one hospital)”
“Evaluation on one hospital; generalization to other EDs unclear”
“Attention visualization shows correlation, not causation”
“No prospective clinical deployment; real-world impact unknown”

Note: Acknowledging limitations demonstrates critical thinking and research maturity. Reviewers appreciate honest limitations more than overstated claims.

Abstract (0.5 pages)

Write this LAST, after everything else is done.

Abstract Structure (250 words)

Context (1-2 sentences): Why does this matter?
Gap (1 sentence): What’s missing?
Method (2-3 sentences): What did you do?
Results (2-3 sentences): What did you find?
Impact (1 sentence): What does it mean?

Example Abstract

Emergency department (ED) overcrowding affects millions of patients annually, requiring accurate outcome prediction for resource allocation. Existing models rely solely on structured EHR data, ignoring patient-reported symptoms. We propose MultiModal-ED, a transformer architecture that fuses structured EHR sequences with free-text symptom descriptions and anatomical sketch drawings. Our model uses ETHOS for EHR encoding, ClinicalBERT for text, and ResNet-50 for sketches, combined via learned cross-modal attention. Evaluated on 50,000 ED visits, MultiModal-ED achieves 0.89 AUROC for mortality prediction, a 6% improvement over ETHOS (p < 0.001). Ablation studies show text contributes +4% and sketches +2%. Attention visualizations reveal the model focuses on symptom severity and anatomical location. This work demonstrates that multimodal patient-reported data significantly improves clinical prediction, with implications for ED triage and resource planning.

Conclusion (0.5 pages)

Concise summary of the paper.

Components

Problem: What did you address?
Approach: How did you tackle it?
Results: What did you achieve?
Impact: What does it mean for the field?

Keep it brief and forward-looking.

Writing Tips

General Guidelines

Active voice: “We propose…” not “A method is proposed…”
Present tense for facts: “Transformers use attention mechanisms”
Past tense for your work: “We trained a model…”
Be concise: Remove unnecessary words
Avoid jargon: Define technical terms
Parallel structure: Use consistent formatting

Common Mistakes

Overselling: “Revolutionary”, “groundbreaking” (let reviewers decide)
Underselling: “Slightly better” when results are strong
Vague claims: “Significantly better” (quantify it!)
Missing details: Can’t reproduce from your Methods
Burying results: Put key findings upfront

LaTeX Template


\documentclass{article}
\usepackage{neurips_2025}  % Or ICML, ICLR template
 
\title{Multimodal Transformers for Emergency Department Outcome Prediction}
\author{Your Name \and Supervisor Name}
 
\begin{document}
\maketitle
 
\begin{abstract}
Your abstract here...
\end{abstract}
 
\section{Introduction}
Emergency department overcrowding...
 
\section{Related Work}
Recent work has applied transformers...
 
\section{Methods}
\subsection{Data}
We use 50,000 ED visits...
 
\subsection{Model Architecture}
Our multimodal model consists of...
 
\section{Results}
Table 1 shows our main results...
 
\section{Discussion}
We find that multimodal fusion...
 
\section{Conclusion}
This work demonstrates...
 
\end{document}

Attention mechanism for model interpretability
Optimization for training details
Clinical interpretability for healthcare applications

Reading Papers - Understanding paper structure
Research Questions - Defining your contribution
Experimental Design - What to report in Results

Key Takeaways

IMRaD structure: Introduction → Methods → Results → Discussion
Reproducibility: Methods should enable replication
Clear contributions: State explicitly what you contributed
Honest limitations: Acknowledge weaknesses shows maturity
Multiple metrics: Report comprehensive evaluation
Visual clarity: Figures and tables should be self-contained
Write abstract last: After you know the full story

Structuring Research Papers

Introduction (1-1.5 pages)

Standard Structure

Writing Tips

Related Work (1-2 pages)

Organization by Theme

For Each Paper

Example Paragraph Structure

Methods (2-3 pages)

Essential Components

Methods Checklist

Results (2-3 pages)

Essential Components

Figure Guidelines

What to Include/Exclude

Discussion (1-2 pages)

Standard Structure

Limitations Example

Abstract (0.5 pages)

Abstract Structure (250 words)

Example Abstract

Conclusion (0.5 pages)

Components

Writing Tips

General Guidelines

Common Mistakes

LaTeX Template

Related Concepts

Related Guides

Key Takeaways