Product
Identity
product_id / upc / ean
Platform-native product ID plus universal UPC-12 and
EAN-13 barcodes enabling precise cross-platform product matching. UPC-level matching is the foundation
of every grocery data intelligence use case — ensuring you're comparing the exact same 32oz Tropicana OJ
across Instacart, Walmart, and Kroger rather than name-matched approximations.
"upc": "742365004871"
Product
Identity
name / brand / manufacturer
Full product name as listed, brand name normalised
across platforms, and manufacturer where disclosed. Essential for brand-level aggregation — the same
Horizon Organic milk appears under slightly different names on Instacart, Amazon Fresh, and Kroger.com
requiring normalisation before any cross-platform analysis.
"brand": "Horizon Organic"
Product
Identity
category / subcategory / aisle
Full category path from the platform (e.g. Dairy > Milk
> Organic Milk) mapped to our standardised US grocery taxonomy, plus the physical aisle descriptor where
exposed. Category data enables shelf-space share analysis, competitive set definition, and
cross-category demand intelligence at scale.
"category": "Dairy > Milk > Organic"
Product
Identity
size / weight / unit / pack_size
Product size in the platform's displayed format (e.g. "1
gallon", "32 fl oz", "6-pack"), normalised weight or volume in standard units, and pack size for
multipack products. Critical for unit-price comparison — without size normalisation, a price comparison
between a 16oz and 32oz product is meaningless.
"size": "1 gallon", "weight_oz": 128
Product
Identity
images[] / thumbnail_url
All product image URLs in display order — main product
shot, alternate angles, and nutrition label images where exposed. Images are used by CPG data teams for
planogram analysis, packaging change detection, and competitive creative benchmarking across retailer
websites.
"image_count": 4
Product
Identity
certifications / labels
Product certifications and label claims extracted from
listings — USDA Organic, Non-GMO Project Verified, Kosher, Halal, Gluten-Free, Fair Trade, Rainforest
Alliance. These on-pack claims appear in platform listings and are increasingly decisive purchase
factors tracked by CPG brand teams and retail buyers.
"labels": ["USDA Organic", "Non-GMO"]
Pricing
price_usd / sale_price_usd / was_price_usd
Current selling price, active sale price, and original
"was" price — all in USD. Quick commerce grocery pricing is highly dynamic: Gopuff and Instacart adjust
prices multiple times per day based on demand, inventory, and competitor signals. Point-in-time
timestamped pricing is the core of every grocery data intelligence use case.
"price_usd": 7.49, "sale_price": 5.99
Pricing
unit_price / price_per_oz / price_per_lb
Calculated unit price in the platform's displayed format
plus normalised price per ounce and price per pound where applicable. Unit price comparison is the most
common consumer-side metric and a regulatory requirement under FTC fair packaging guidelines — tracking
it across platforms is essential for price parity analytics.
"price_per_oz": "$0.047 / oz"
Pricing
loyalty_price_usd / loyalty_program
Loyalty card or membership price (e.g. Kroger Plus,
Safeway Club, Target Circle) and the program name. Loyalty pricing in US grocery can be 15–30% below
shelf price — invisible to standard price monitoring but the price most loyalty-enrolled consumers
actually pay. This field exposes the real competitive price for most US grocery shoppers.
"loyalty_price": 4.99, "program": "Kroger Plus"
Pricing
digital_coupon_value / coupon_type
Active digital coupon face value in USD and type
(instant savings, manufacturer, load-to-card, clippable). Digital grocery coupons are a $3B+ annual
market in the US — a product with a $1.50 digital coupon has a materially different competitive price
position than its listed price suggests, yet most price monitoring tools don't capture this layer.
"coupon_value": 1.50, "type": "load-to-card"
Pricing
price_history[] / price_30d_low
Time-series price data going back up to 24 months —
essential for US grocery inflation tracking, promotional cadence analysis (many CPG brands run a 4-week
promotional cycle), and competitive price intelligence. Each price point is timestamped with the
platform, retailer, and ZIP code, enabling hyper-local price trend analysis.
"price_30d_low": 4.99
Pricing
promo_type / promo_end_date / bogo_flag
Active promotional mechanic type (BOGO, buy 2 save $1, %
off, dollar off, mix & match), promotion end date, and boolean BOGO flag. US grocery promotions are
structurally complex — a BOGO deal creates an effective unit price 50% below the listed price. Capturing
promotional mechanics is essential for true price intelligence in the CPG channel.
"promo_type": "BOGO", "end_date": "2025-06-08"
Availability
in_stock / stock_level / oos_since
Boolean in-stock flag, qualitative stock level (In Stock
/ Low Stock / Out of Stock), and the timestamp when the product first went out of stock. OOS detection
is one of the most valuable quick commerce data signals — a competitor going OOS on a key SKU opens a
demand capture window that can be measured in hours before they restock.
"in_stock": true, "stock_level": "Low"
Availability
zip_code / store_id / retailer_name
The specific ZIP code, store ID, and retailer name for
which this price and availability record was collected. US grocery pricing is hyper-local — a gallon of
Horizon Organic milk can differ by $1.50 between two ZIP codes in the same city served by the same
chain. Every record is tied to a specific geographic location.
"zip_code": "10001", "retailer": "Whole Foods"
Availability
pickup_available / curbside_available
Whether the product is available for in-store pickup and
curbside collection at the specific store location. BOPIS (Buy Online, Pick Up In Store) and curbside
pickup have become critical omnichannel fulfillment methods post-pandemic — tracking their availability
and product assortment coverage is essential for omnichannel retail analytics.
"pickup_available": true
Delivery
Intelligence
delivery_window_min / delivery_fee /
free_delivery_threshold
Minimum delivery window in minutes (critical for quick
commerce — Gopuff promises 30 minutes, Instacart 1 hour), delivery fee in USD, and the cart value
threshold for free delivery. These three fields together define the consumer's true cost of using a
quick commerce platform for any given order — essential for competitive positioning analysis.
"delivery_min": 30, "fee": 3.99
Delivery
Intelligence
delivery_slots[] / next_available_slot
Available future delivery time slots and the next
available slot timestamp. Delivery slot availability is a direct proxy for local demand and platform
capacity — when Instacart or Amazon Fresh slots fill up 48 hours in advance, it signals demand spikes
that affect product availability and fulfillment reliability in that market.
"next_slot": "2025-06-04T14:00:00"
Delivery
Intelligence
platform_fee / service_fee / tip_suggested
Platform service fee, any per-order platform fee, and
the suggested tip amount. The true consumer cost of a quick commerce grocery order includes multiple fee
layers beyond product price — platforms frequently adjust these to manage demand. Tracking the total
order economics is essential for consumer price perception research and competitive platform analysis.
"service_fee": 1.99, "platform_fee": 0.99
Nutrition &
Ingredients
calories / macros / serving_size
Calories per serving, macronutrient breakdown (protein,
fat, carbohydrates, fiber, sugar), and serving size as displayed on the nutrition label. Extracted from
platform nutrition panels and structured into a consistent schema — enabling nutritional benchmarking,
health claim validation, and dietary compliance analysis across the full product catalog.
"calories": 150, "protein_g": 8
Nutrition &
Ingredients
ingredients[] / allergens[]
Full ingredient list and allergen declarations (the 9
major FDA-recognised allergens: milk, eggs, fish, shellfish, tree nuts, peanuts, wheat, soybean, sesame)
extracted from listing pages. Powers allergen detection tools, ingredient transparency apps,
reformulation tracking, and FDA label compliance monitoring for food brands and retailers.
"allergens": ["milk", "soy"]
Ratings &
Reviews
rating / review_count / verified_purchases
Aggregate product rating (1–5 stars), total review
count, and verified purchase count where available. US grocery consumers rely heavily on ratings for new
product discovery — particularly on Instacart and Amazon Fresh where ratings directly influence search
ranking and recommendation placement, making this a key shelf visibility signal.
"rating": 4.8, "review_count": 2841
Ratings &
Reviews
top_reviews[] / review_keywords
Top 10 most helpful consumer reviews with text, rating,
date, and verified flag, plus AI-extracted keyword themes. Grocery reviews surface quality issues, taste
profile feedback, and packaging problems that brand teams need for product iteration — "tastes watery",
"packaging leaks", and "smaller than shown" are quality signals with no equivalent in sales data.
"keywords": ["fresh", "great value", "easy open"]
Basket Analytics
basket_frequency / basket_penetration
Measures how often a product appears in customer baskets and the percentage of orders containing the item. Used to identify staple products, basket drivers, and category penetration across grocery retailers.
"basket_penetration_pct": 18.4
Freshness & Shelf Life
expiration_date / shelf_life_days
Product expiration date and estimated shelf life remaining at the time of collection. Critical for perishables including dairy, produce, meat, and ready-to-eat grocery products.
"expiration_date": "2025-06-18", "shelf_life_days": 12
Demand Signals
popularity_score / trending_rank
Product popularity score and category trend ranking based on search activity, purchases, and shopper engagement. Helps identify fast-growing products and emerging consumer preferences.
"popularity_score": 92, "trending_rank": 7