
You can think of a return reason taxonomy as the diagnostic chart for your returns. Instead of a vague “sizing issue,” it captures the precise why—“too small at chest,” “inseam too long,” “fabric less stretchy than expected”—in a consistent, machine-readable way across every return touchpoint.
According to the 2025 Narvar analysis, size and fit were cited by 42% of consumers as the reason for their last return, underscoring why granular fit data matters so much in apparel (“42% cited size and fit issues” — Narvar, 2025). At the same time, Baymard’s UX research highlights a root cause: many product pages still don’t equip shoppers to choose correctly; in 2024 Baymard reported that 83% of apparel sites lack sufficient sizing information (“83% insufficient sizing info — Baymard 2024 Year in Review”). And the stakes are high: U.S. retail returns reached roughly $890B in 2024, per the National Retail Federation’s joint reporting (NRF and Happy Returns, 2024 returns total $890B).
What a Return Reason Taxonomy Is (and Isn’t)
- Core definition: A structured, hierarchical classification that standardizes why products are returned. In apparel, it prioritizes sizing and fit granularity to power analytics, product improvements, PDP content, and policy decisions.
- Boundaries: It isn’t a size chart, a returns policy, or a product taxonomy. It labels the cause of a return so teams can measure, learn, and prevent repeats.
Why it matters now
- Fit is the leading consumer-stated reason in 2025 apparel returns, so measurement needs to be specific enough to be actionable (see Narvar 2025 reference above).
- UX gaps persist on PDPs (Baymard 2024), making it critical to close the loop from returns data back to content, size guides, and recommendations.
- There’s no single, industry-mandated “reason code” standard (NRF/GS1 don’t prescribe one), so retailers should adopt their own machine-readable taxonomy linked to global product identifiers for interoperability (GS1 Digital Link Standard, 2024; GS1 CBV Standard 2.0).
A Practical Apparel Taxonomy (2025)
Tier 1 categories (parent buckets)
- Fit & Sizing
- Product Description / Expectation Gap
- Quality / Defect / Damage
- Shipping / Logistics Issues
- Changed Mind / No Longer Needed
- Ordering Error
- Price / Promotion Issues
- Other (with optional free text)
Fit & Sizing — Tier 2 and Tier 3 examples
- Directional fit issues
- Too small → body area: chest/bust, waist, hip, shoulder, sleeve, inseam, thigh/calf, overall
- Too large → same body-area set
- Too short / Too long → torso length, sleeve length, inseam, rise
- Style and fabric-related
- Fit style mismatch → slim vs. regular vs. relaxed/oversized
- Fabric behavior → less stretchy than expected; drape/stiffness causes poor fit
- Consistency and guidance
- Size inconsistency → deviates from brand’s size chart; cross-brand conversion off
- Measurement confusion → unclear chart, body vs. garment measurement, units
- Footwear (if applicable)
- Length, width, instep, volume
Example code mapping (machine-readable + human-friendly)
Parent | Child | Body area | Human label | Machine code |
---|---|---|---|---|
Fit & Sizing | Too small | Chest/Bust | Too small at chest/bust | FIT_TOO_SMALL_CHEST |
Fit & Sizing | Too long | Inseam | Inseam too long | FIT_TOO_LONG_INSEAM |
Fit & Sizing | Fit style mismatch | — | Slim fit felt too tight | FIT_STYLE_MISMATCH_SLIM |
Fit & Sizing | Fabric behavior | — | Less stretchy than expected | FIT_FABRIC_LESS_STRETCH |
Description gap | Color | — | Color looked different in person | DESC_COLOR_OFF |
Quality/Defect | Zipper | — | Zipper failure | QUAL_ZIPPER_FAILURE |
Shipping | Damaged in transit | — | Damaged packaging/items | SHIP_DAMAGED_IN_TRANSIT |
Ordering error | Wrong size ordered | — | Ordered wrong size | ORDER_WRONG_SIZE |
Other | — | — | Other (describe) | OTHER_FREE_TEXT |
Implementation playbook: low-friction capture, high-quality data
-
Customer-facing flow (web/app)
- Progressive disclosure: Keep it to 2–3 taps. Example micro-flow for bottoms: “Fit & Sizing” → “Too long” → “Inseam.” Offer an optional text box (“Tell us more: e.g., 2 inches too long”).
- Exchange-first: Promote exchanges when a fit reason is selected to preserve revenue and improve outcome for the shopper. Benchmarks and vendor practices across 2024–2025 often highlight exchange-first as a win for both sides (see industry context via NRF above and platform ecosystems).
- Context rules: Only show inseam/rise options for bottoms; show chest/shoulder for tops. This reduces errors and effort.
-
In-store and CS capture
- Quick-pick lists with the same taxonomy. Train associates to probe for the minimal extra detail (“waist vs. hip?”) without slowing the line.
- Scan barcode/GTIN/UPC to auto-link the exact variant, improving attribution to size runs and vendors (align IDs to GS1 guidance like Digital Link and CBV for consistency; see GS1 references above).
-
Systems integration
- Shopify: Many merchants capture reason codes via RMA apps and store them as metadata because the Admin API focuses on return objects and workflows rather than prescribing reason fields; see the current Admin GraphQL documentation for returns flows like returnCreate and release notes for 2024–2025 changes (Shopify Admin GraphQL: returnCreate; Shopify Admin API release notes 2024-10).
- BigCommerce: Use returns/refunds workflows plus app/custom fields; the Orders v3 API exposes refunds endpoints that you can join with reason metadata captured elsewhere (BigCommerce Orders v3 — Refunds).
- Adobe Commerce (Magento): Configure RMA reasons, conditions, and resolutions natively in Admin; extend with custom attributes where needed (Adobe Commerce: Returns (RMA); Returns attributes).
- Data flow: Push reason_code with order/item identifiers into OMS/WMS and your analytics stack via webhooks or ETL. Version your taxonomy so historical analyses remain valid.
Data model and governance
- Recommended fields
- reason_code (string), reason_label, parent_category, fit_direction (too small/too large/too short/too long), body_area, garment_type, fabric_stretch (low/med/high), channel (web/app/store/CS), return_type (refund/exchange), exchange_to_size, order_date, return_date, days_to_return, taxonomy_version.
- Controlled vocabulary + optional free text
- Use machine codes like FIT_TOO_SMALL_CHEST. Permit a short optional note; mine it with NLP quarterly to promote recurring phrases into new controlled codes.
- Validation and versioning
- Context-aware rules (e.g., no inseam for tops). Maintain taxonomy_version with active_from and deprecated_on for clean longitudinal analysis.
- Interoperability
- Map SKUs/variants to GTIN/UPC and maintain product identity consistently using GS1 best practices (GS1 Digital Link, 2024; GS1 CBV 2.0).
Using the data: from diagnostics to prevention
- PDP enhancements
- Add garment measurement tables with clear unit toggles and explain body vs. garment measurements. Baymard’s research outlines UX patterns that help shoppers choose correctly, including using an aggregate fit subscore in reviews to set expectations (Baymard on aggregate fit subscores).
- Highlight model measurements and size worn; enable UGC filters by body type to reduce guesswork.
- Merchandising and product development
- Identify body-area hotspots (e.g., hips in a particular denim style) and adjust grading, tolerances, or pattern blocks. Track by vendor and season.
- Sizing and AI fit tools
- Feed clean reason_code data into fit-prediction models or PDP guidance. High-quality, body-area-labeled data improves recommendations compared to generic “sizing issue” labels.
- Policy tuning
- Encourage exchanges for Fit & Sizing while maintaining stricter rules or fees for non-fit discretionary returns to curb abuse—an approach commonly reported in 2024–2025 retail operations commentary alongside overall returns context (NRF 2024 macro returns context).
Metrics that matter (and practical targets)
- Primary metrics
- Return Rate (RR), Fit-Related Return Rate (FRR), Exchange Rate, Size-Exchange Success Rate, Days-to-Return.
- Diagnostic metrics
- Top body-area mismatches, size-run outliers, brand-to-chart deviation, fabric stretch miss rate, “Other%,” reason completeness.
- Benchmarks and goals
- Use external context carefully: overall U.S. return rate was 16.9% in 2024 per NRF; treat this as directional, not a target for apparel specifically (NRF 2024 returns total and rate).
- Set an internal goal to reduce FRR by 10–20% over one to two seasons through PDP and sizing improvements, merchandising changes, and exchange-first flows.
Common pitfalls (and how to avoid them)
- Over-granularity that adds friction → Use progressive branching and keep to 2–3 taps.
- Inconsistent capture across channels → Make parent + child codes mandatory and train staff.
- Free-text sprawl → Mine notes and fold recurrent themes into new codes quarterly.
- Skewed insights due to underrepresented shopper cohorts → Monitor FRR by cohort (e.g., petite vs. tall) and validate that fixes help each group.
Quick-start checklist
- Define Tier 1–3 with emphasis on Fit & Sizing; create machine codes (e.g., FIT_TOO_LONG_INSEAM).
- Configure return flows with progressive disclosure; enable exchange-first for Fit & Sizing.
- Implement validation rules (garment-type-aware body areas).
- Pipe reason_code and taxonomy_version to your OMS/WMS and BI via webhook/ETL.
- Build BI views segmented by body area, garment type, fabric stretch, vendor, cohort.
- Monitor “Other%” and “Unspecified%”; trigger taxonomy review if either exceeds 10%.
- Close the loop on PDP (size guides, measurements), merchandising (grading/tolerance), and policy.
Appendix: compact taxonomy table (starter)
Tier 1 | Tier 2 | Tier 3 (optional) | Code example |
---|---|---|---|
Fit & Sizing | Too small | Chest/Bust | FIT_TOO_SMALL_CHEST |
Fit & Sizing | Too large | Waist | FIT_TOO_LARGE_WAIST |
Fit & Sizing | Too short | Sleeve | FIT_TOO_SHORT_SLEEVE |
Fit & Sizing | Too long | Rise | FIT_TOO_LONG_RISE |
Fit & Sizing | Style mismatch | Slim vs. regular | FIT_STYLE_MISMATCH_SLIM |
Fit & Sizing | Fabric behavior | Less stretchy than expected | FIT_FABRIC_LESS_STRETCH |
Description gap | Color off | — | DESC_COLOR_OFF |
Quality/Defect | Stitching | Seam unravel | QUAL_STITCH_SEAM |
Shipping | Damaged in transit | — | SHIP_DAMAGED_IN_TRANSIT |
Ordering error | Wrong size ordered | — | ORDER_WRONG_SIZE |
Other | — | Free text | OTHER_FREE_TEXT |
References and documentation
- Size/fit as leading reason in 2025 consumer returns: “42% cited size and fit issues” — Narvar, 2025
- UX gaps driving fit mistakes (2024): Baymard 2024 Year in Review: 83% insufficient sizing info and Baymard guidance on aggregate fit subscores
- Macro scale of returns (2024): NRF and Happy Returns: retail returns total $890B
- Platform and standards context: Shopify Admin GraphQL: returnCreate; Shopify Admin API release notes 2024-10; BigCommerce Orders v3 — Refunds; Adobe Commerce RMA; GS1 Digital Link (2024); GS1 CBV 2.0