Return Reason Taxonomy (Apparel Sizing & Fit Analytics)

3 September 2025 by

WarpDriven

Cover — Image Source: statics.mylandingpages.co

You can think of a return reason taxonomy as the diagnostic chart for your returns. Instead of a vague “sizing issue,” it captures the precise why—“too small at chest,” “inseam too long,” “fabric less stretchy than expected”—in a consistent, machine-readable way across every return touchpoint.

According to the 2025 Narvar analysis, size and fit were cited by 42% of consumers as the reason for their last return, underscoring why granular fit data matters so much in apparel (“42% cited size and fit issues” — Narvar, 2025). At the same time, Baymard’s UX research highlights a root cause: many product pages still don’t equip shoppers to choose correctly; in 2024 Baymard reported that 83% of apparel sites lack sufficient sizing information (“83% insufficient sizing info — Baymard 2024 Year in Review”). And the stakes are high: U.S. retail returns reached roughly $890B in 2024, per the National Retail Federation’s joint reporting (NRF and Happy Returns, 2024 returns total $890B).

What a Return Reason Taxonomy Is (and Isn’t)

Core definition: A structured, hierarchical classification that standardizes why products are returned. In apparel, it prioritizes sizing and fit granularity to power analytics, product improvements, PDP content, and policy decisions.
Boundaries: It isn’t a size chart, a returns policy, or a product taxonomy. It labels the cause of a return so teams can measure, learn, and prevent repeats.

Why it matters now

Fit is the leading consumer-stated reason in 2025 apparel returns, so measurement needs to be specific enough to be actionable (see Narvar 2025 reference above).
UX gaps persist on PDPs (Baymard 2024), making it critical to close the loop from returns data back to content, size guides, and recommendations.
There’s no single, industry-mandated “reason code” standard (NRF/GS1 don’t prescribe one), so retailers should adopt their own machine-readable taxonomy linked to global product identifiers for interoperability (GS1 Digital Link Standard, 2024; GS1 CBV Standard 2.0).

A Practical Apparel Taxonomy (2025)

Tier 1 categories (parent buckets)

Fit & Sizing
Product Description / Expectation Gap
Quality / Defect / Damage
Shipping / Logistics Issues
Changed Mind / No Longer Needed
Ordering Error
Price / Promotion Issues
Other (with optional free text)

Fit & Sizing — Tier 2 and Tier 3 examples

Directional fit issues
- Too small → body area: chest/bust, waist, hip, shoulder, sleeve, inseam, thigh/calf, overall
- Too large → same body-area set
- Too short / Too long → torso length, sleeve length, inseam, rise
Style and fabric-related
- Fit style mismatch → slim vs. regular vs. relaxed/oversized
- Fabric behavior → less stretchy than expected; drape/stiffness causes poor fit
Consistency and guidance
- Size inconsistency → deviates from brand’s size chart; cross-brand conversion off
- Measurement confusion → unclear chart, body vs. garment measurement, units
Footwear (if applicable)
- Length, width, instep, volume

Example code mapping (machine-readable + human-friendly)

Parent	Child	Body area	Human label	Machine code
Fit & Sizing	Too small	Chest/Bust	Too small at chest/bust	FIT_TOO_SMALL_CHEST
Fit & Sizing	Too long	Inseam	Inseam too long	FIT_TOO_LONG_INSEAM
Fit & Sizing	Fit style mismatch	—	Slim fit felt too tight	FIT_STYLE_MISMATCH_SLIM
Fit & Sizing	Fabric behavior	—	Less stretchy than expected	FIT_FABRIC_LESS_STRETCH
Description gap	Color	—	Color looked different in person	DESC_COLOR_OFF
Quality/Defect	Zipper	—	Zipper failure	QUAL_ZIPPER_FAILURE
Shipping	Damaged in transit	—	Damaged packaging/items	SHIP_DAMAGED_IN_TRANSIT
Ordering error	Wrong size ordered	—	Ordered wrong size	ORDER_WRONG_SIZE
Other	—	—	Other (describe)	OTHER_FREE_TEXT

Implementation playbook: low-friction capture, high-quality data

Customer-facing flow (web/app)
- Progressive disclosure: Keep it to 2–3 taps. Example micro-flow for bottoms: “Fit & Sizing” → “Too long” → “Inseam.” Offer an optional text box (“Tell us more: e.g., 2 inches too long”).
- Exchange-first: Promote exchanges when a fit reason is selected to preserve revenue and improve outcome for the shopper. Benchmarks and vendor practices across 2024–2025 often highlight exchange-first as a win for both sides (see industry context via NRF above and platform ecosystems).
- Context rules: Only show inseam/rise options for bottoms; show chest/shoulder for tops. This reduces errors and effort.
In-store and CS capture
- Quick-pick lists with the same taxonomy. Train associates to probe for the minimal extra detail (“waist vs. hip?”) without slowing the line.
- Scan barcode/GTIN/UPC to auto-link the exact variant, improving attribution to size runs and vendors (align IDs to GS1 guidance like Digital Link and CBV for consistency; see GS1 references above).
Systems integration
- Shopify: Many merchants capture reason codes via RMA apps and store them as metadata because the Admin API focuses on return objects and workflows rather than prescribing reason fields; see the current Admin GraphQL documentation for returns flows like returnCreate and release notes for 2024–2025 changes (Shopify Admin GraphQL: returnCreate; Shopify Admin API release notes 2024-10).
- BigCommerce: Use returns/refunds workflows plus app/custom fields; the Orders v3 API exposes refunds endpoints that you can join with reason metadata captured elsewhere (BigCommerce Orders v3 — Refunds).
- Adobe Commerce (Magento): Configure RMA reasons, conditions, and resolutions natively in Admin; extend with custom attributes where needed (Adobe Commerce: Returns (RMA); Returns attributes).
- Data flow: Push reason_code with order/item identifiers into OMS/WMS and your analytics stack via webhooks or ETL. Version your taxonomy so historical analyses remain valid.

Data model and governance

Recommended fields
- reason_code (string), reason_label, parent_category, fit_direction (too small/too large/too short/too long), body_area, garment_type, fabric_stretch (low/med/high), channel (web/app/store/CS), return_type (refund/exchange), exchange_to_size, order_date, return_date, days_to_return, taxonomy_version.
Controlled vocabulary + optional free text
- Use machine codes like FIT_TOO_SMALL_CHEST. Permit a short optional note; mine it with NLP quarterly to promote recurring phrases into new controlled codes.
Validation and versioning
- Context-aware rules (e.g., no inseam for tops). Maintain taxonomy_version with active_from and deprecated_on for clean longitudinal analysis.
Interoperability
- Map SKUs/variants to GTIN/UPC and maintain product identity consistently using GS1 best practices (GS1 Digital Link, 2024; GS1 CBV 2.0).

Using the data: from diagnostics to prevention

PDP enhancements
- Add garment measurement tables with clear unit toggles and explain body vs. garment measurements. Baymard’s research outlines UX patterns that help shoppers choose correctly, including using an aggregate fit subscore in reviews to set expectations (Baymard on aggregate fit subscores).
- Highlight model measurements and size worn; enable UGC filters by body type to reduce guesswork.
Merchandising and product development
- Identify body-area hotspots (e.g., hips in a particular denim style) and adjust grading, tolerances, or pattern blocks. Track by vendor and season.
Sizing and AI fit tools
- Feed clean reason_code data into fit-prediction models or PDP guidance. High-quality, body-area-labeled data improves recommendations compared to generic “sizing issue” labels.
Policy tuning
- Encourage exchanges for Fit & Sizing while maintaining stricter rules or fees for non-fit discretionary returns to curb abuse—an approach commonly reported in 2024–2025 retail operations commentary alongside overall returns context (NRF 2024 macro returns context).

Metrics that matter (and practical targets)

Primary metrics
- Return Rate (RR), Fit-Related Return Rate (FRR), Exchange Rate, Size-Exchange Success Rate, Days-to-Return.
Diagnostic metrics
- Top body-area mismatches, size-run outliers, brand-to-chart deviation, fabric stretch miss rate, “Other%,” reason completeness.
Benchmarks and goals
- Use external context carefully: overall U.S. return rate was 16.9% in 2024 per NRF; treat this as directional, not a target for apparel specifically (NRF 2024 returns total and rate).
- Set an internal goal to reduce FRR by 10–20% over one to two seasons through PDP and sizing improvements, merchandising changes, and exchange-first flows.

Common pitfalls (and how to avoid them)

Over-granularity that adds friction → Use progressive branching and keep to 2–3 taps.
Inconsistent capture across channels → Make parent + child codes mandatory and train staff.
Free-text sprawl → Mine notes and fold recurrent themes into new codes quarterly.
Skewed insights due to underrepresented shopper cohorts → Monitor FRR by cohort (e.g., petite vs. tall) and validate that fixes help each group.

Quick-start checklist

Define Tier 1–3 with emphasis on Fit & Sizing; create machine codes (e.g., FIT_TOO_LONG_INSEAM).
Configure return flows with progressive disclosure; enable exchange-first for Fit & Sizing.
Implement validation rules (garment-type-aware body areas).
Pipe reason_code and taxonomy_version to your OMS/WMS and BI via webhook/ETL.
Build BI views segmented by body area, garment type, fabric stretch, vendor, cohort.
Monitor “Other%” and “Unspecified%”; trigger taxonomy review if either exceeds 10%.
Close the loop on PDP (size guides, measurements), merchandising (grading/tolerance), and policy.

Appendix: compact taxonomy table (starter)

Tier 1	Tier 2	Tier 3 (optional)	Code example
Fit & Sizing	Too small	Chest/Bust	FIT_TOO_SMALL_CHEST
Fit & Sizing	Too large	Waist	FIT_TOO_LARGE_WAIST
Fit & Sizing	Too short	Sleeve	FIT_TOO_SHORT_SLEEVE
Fit & Sizing	Too long	Rise	FIT_TOO_LONG_RISE
Fit & Sizing	Style mismatch	Slim vs. regular	FIT_STYLE_MISMATCH_SLIM
Fit & Sizing	Fabric behavior	Less stretchy than expected	FIT_FABRIC_LESS_STRETCH
Description gap	Color off	—	DESC_COLOR_OFF
Quality/Defect	Stitching	Seam unravel	QUAL_STITCH_SEAM
Shipping	Damaged in transit	—	SHIP_DAMAGED_IN_TRANSIT
Ordering error	Wrong size ordered	—	ORDER_WRONG_SIZE
Other	—	Free text	OTHER_FREE_TEXT

References and documentation

Size/fit as leading reason in 2025 consumer returns: “42% cited size and fit issues” — Narvar, 2025
UX gaps driving fit mistakes (2024): Baymard 2024 Year in Review: 83% insufficient sizing info and Baymard guidance on aggregate fit subscores
Macro scale of returns (2024): NRF and Happy Returns: retail returns total $890B
Platform and standards context: Shopify Admin GraphQL: returnCreate; Shopify Admin API release notes 2024-10; BigCommerce Orders v3 — Refunds; Adobe Commerce RMA; GS1 Digital Link (2024); GS1 CBV 2.0

in Industry

WarpDriven 3 September 2025

Return Reason Taxonomy (Apparel Sizing & Fit Analytics)

Share this post

Tags

Our blogs

Archive