Flight
flight_number / carrier_code
IATA airline code and flight number scraped in real-time
from Indian airline websites and OTA aggregators. Covers IndiGo, Air India, SpiceJet, Vistara (now Air
India), Akasa Air, and all operating Indian carriers across 100+ domestic routes.
"6E2041" / "6E" (IndiGo)
Flight
origin / destination
IATA airport codes and city names for departure and
arrival, normalized across all Indian airports including metro hubs (DEL, BOM, BLR, MAA, CCU, HYD) and
Tier-2 destinations like Indore, Coimbatore, and Guwahati.
{"origin": "DEL", "destination": "BOM"}
Flight
departure_datetime / arrival_datetime
Scheduled and actual departure/arrival timestamps in IST
(UTC+5:30) and UTC. Includes real-time delay status, tarmac hold flags, gate changes, and cancellation
codes — essential for Indian aviation intelligence platforms.
"2025-06-15T06:30:00+05:30"
Flight
fare_class / cabin
Booking class code and cabin type (economy, premium
economy, business) with seat availability counts. Tracks IndiGo's Super Saver vs. Flexi fares, Air
India's business class yield, and budget airline restricted fare conditions side-by-side.
{"class": "Y", "cabin": "economy", "seats_left": 6}
Flight
stops / layover_airports
Number of stops and intermediate Indian airport codes
with layover durations. Common via-points like Hyderabad (HYD) and Bangalore (BLR) for Northeast routes
are fully mapped, enabling nonstop-only filters and true journey-time comparisons.
{"stops": 1, "layovers": [{"airport": "HYD", "min":
75}]}
Flight
baggage_policy
Carry-on and checked bag allowances with fees per
booking class scraped from all Indian carrier policy pages. Especially critical for low-cost carrier
(LCC) comparison as IndiGo, SpiceJet, and Akasa Air charge separately for baggage add-ons.
{"carry_on": "7kg_free", "checked_15kg": 699}
Hotel
hotel_name / property_type
Full property name and classification covering India's
unique accommodation spectrum — heritage hotels, palace hotels, dharamshalas, homestays, resorts,
service apartments, and budget lodges — normalized across MakeMyTrip, Goibibo, and Booking.com.
"Umaid Bhawan Palace" / "heritage_palace"
Hotel
star_rating / guest_score
Official star classification (Ministry of Tourism
approved) and platform-specific guest review score on a normalized 0–10 scale. Bridges inconsistencies
across MakeMyTrip's "wallet points" system, Goibibo's Go Cash scores, and Booking.com's 10-point scale.
{"stars": 5, "guest_score": 9.1}
Hotel
amenities
Standardized list of 65+ amenity flags including AC
type, rooftop, restaurant cuisine type, in-house travel desk, airport pickup, yoga room, Ayurveda spa,
and wedding/banquet facilities — scraped and mapped from unstructured Indian property pages.
["pool", "ayurveda_spa", "airport_pickup",
"wedding_hall"]
Hotel
room_types / max_occupancy
All available room configurations with Indian-specific
naming (Superior, Deluxe, Club, Presidential) plus bed type, AC classification, view type, and max
occupancy. Paired with per-room pricing across all listed OTAs for full comparison coverage.
{"type": "deluxe_sea_view", "sqft": 480, "max_occ":
3}
Hotel
meal_plan
India-specific meal plan classification — EP (Room
Only), CP (Breakfast), MAP (Breakfast + Dinner), AP (All Meals) — scraped per rate and OTA. Meal plan
inclusions significantly affect true price comparisons in the Indian market.
"MAP" (Breakfast + Dinner Included)
Hotel
availability_calendar
30/60/90-day rolling availability view with
open/sold-out dates and surge pricing across peak Indian travel seasons — Diwali, Dussehra, Christmas,
and IPL season — where hotel rates spike 3–5x versus off-peak.
{"2025-10-20": {"available": false, "note": "Diwali
peak"}}
Pricing
price_per_night_inr
Current base rate per night in INR with GST-inclusive
and GST-exclusive variants (Indian hotels display rates both ways). Tracked with timestamps for
real-time rate parity monitoring across all Indian OTA channels.
{"base": 18500, "gst_inclusive": 21830}
Pricing
ota_price_comparison
Same-property pricing captured simultaneously across
MakeMyTrip, Goibibo, Cleartrip, Ixigo, Yatra, Agoda, and Booking.com. Price spreads of 8–25% are common
in India — this field surfaces arbitrage and parity violations at scale.
{"makemytrip": 18500, "goibibo": 18200, "cleartrip":
18950}
Pricing
gst_breakdown
Itemized GST slab applied to the booking — 12% for
hotels below ₹7,500/night, 18% above — plus any other mandatory state levies. Critical for corporate
travel reimbursement platforms and GST input credit reconciliation tools.
{"gst_slab": "18%", "gst_amount": 3330, "total_inr":
21830}
Pricing
price_history
Historical rate snapshots timestamped up to 24 months in
INR. Track seasonal pricing spikes around Indian festivals, long weekends, and IPL season; model advance
purchase discount curves for any property across India's 28 states and 8 UTs.
[{"date":"2025-01","price_inr":12400},
{"date":"2025-10","price_inr":41000}]
OTA
ota_listing_url
Direct listing URL on each Indian OTA platform including
MakeMyTrip, Goibibo, Cleartrip, Ixigo, Yatra, EaseMyTrip, and Agoda — enabling deep-link integration
into comparison tools and price alert platforms for the Indian travel market.
"https://www.makemytrip.com/hotels/hotel_detail/?hotelId=..."
OTA
coupon_codes / wallet_offers
Platform-specific promo codes, HDFC/ICICI/SBI bank card
offers, Go Cash wallet balances, and MakeMyTrip double dip deals scraped as they appear — with expiry
timestamps, eligible payment methods, and effective discount percentages.
{"code": "HDFCFLY", "discount_pct": 12, "bank":
"HDFC"}
Train /
IRCTC
train_number / train_name
Indian Railways train number and official name scraped
from IRCTC and third-party train tracking platforms. Covers all express, superfast, Vande Bharat,
Rajdhani, Shatabdi, and Duronto services across India's 68,000+ km rail network.
{"number": "12301", "name": "Howrah Rajdhani"}
Train /
IRCTC
class_availability / quota
Seat and berth availability by class (1A, 2A, 3A, SL,
CC, EC, 3E) and quota type (general, tourist, tatkal, premium tatkal, ladies, senior citizen) — updated
every 15 minutes to mirror IRCTC's live seat inventory.
{"2A": {"avail": 12, "quota": "GN"}, "SL": {"avail":
0, "WL": 45}}
Train /
IRCTC
tatkal_fare / dynamic_price
Tatkal and premium tatkal surcharge amounts by class,
plus current dynamic pricing multiplier. Dynamic pricing on Rajdhani and Shatabdi trains can push fares
1.5–2x base — this field tracks the real-time multiplier for fare alert and booking optimization tools.
{"base_fare": 1240, "tatkal_surcharge": 400,
"dynamic_mult": 1.4}
Reviews
review_sentiment_score
Aggregated sentiment score from structured parsing of
English and Hindi guest review text. Subscore breakdowns by cleanliness, location, value, service, food
quality, and staff behavior — enabling NLP-ready Indian travel datasets for AI model training.
{"overall": 9.1, "food": 9.4, "cleanliness": 8.9}
Location
geo_coordinates
Verified latitude/longitude with sub-50m accuracy across
India's 740+ districts. Includes Plus Codes and Gram Panchayat-level identifiers for rural properties in
pilgrimage destinations like Varanasi, Tirupati, and Kedarnath.
{"lat": 19.0449, "lng": 72.8186}
Location
address / pincode / state
Full standardized Indian address including street,
locality, city, district, state, and 6-digit PIN code — normalized to India Post standards. Covers all
28 states, 8 Union Territories, and hill station addresses where conventional addressing is
inconsistent.
"Bandstand, Bandra West, Mumbai, MH 400050"
Location
nearby_attractions / poi_distance
Distance in km to top POIs including airports, railway
stations, pilgrimage sites, national parks, metro stations, and tech parks. Validated across India's
4,000+ cities for proximity filtering in Indian consumer travel apps.
{"airport_km": 14.2, "gateway_of_india_km": 0.8}