Flight
flight_number / carrier_code
IATA airline code and flight number scraped in real-time
from airline websites and GDS aggregators. Critical for schedule monitoring, delay alerting, and
competitive fare analysis tools.
"AA1234" / "AA"
Flight
origin / destination
IATA airport codes and full city names for departure and
arrival. Normalized to handle multi-leg routes, codeshare variations, and alternate airport groupings
(e.g. NYC area airports).
{"origin": "LAX", "destination": "JFK"}
Flight
departure_datetime / arrival_datetime
Scheduled and actual departure/arrival timestamps in UTC
and local timezone. Includes real-time delay status, gate changes, and cancellation flags for live
flight intelligence pipelines.
"2025-06-15T08:30:00-07:00"
Flight
fare_class / cabin
Booking class code (Y, B, M, Q…) and cabin type
(economy, premium economy, business, first) with seat availability counts — enabling yield management
and seat inventory monitoring.
{"class": "Y", "cabin": "economy", "seats_left": 4}
Flight
stops / layover_airports
Number of stops and intermediate airport codes with
layover duration in minutes. Used to calculate true travel time, filter nonstop-only results, and power
itinerary comparison engines.
{"stops": 1, "layovers": [{"airport": "ORD",
"duration_min": 90}]}
Flight
baggage_policy
Carry-on and checked bag allowances with associated fees
per booking class and airline. Scraped from airline policy pages and OTA baggage detail sections —
critical for total trip cost calculations.
{"carry_on": "free", "checked_1": 35.00}
Hotel
hotel_name / property_type
Full property name and classification (hotel, motel,
boutique, resort, hostel, serviced apartment). Normalized across OTAs to provide consistent naming
despite platform-level inconsistencies.
"The Ritz-Carlton" / "luxury_resort"
Hotel
star_rating / guest_score
Official star classification and platform-specific guest
review score (0–10 scale normalized). Enables apples-to-apples comparisons across Booking.com, Expedia,
Hotels.com, and TripAdvisor.
{"stars": 4, "guest_score": 8.7}
Hotel
amenities
Standardized list of 60+ amenity flags including pool,
gym, spa, pet-friendly, EV charging, airport shuttle, free breakfast, and rooftop bar — parsed from
unstructured property description pages.
["pool", "gym", "spa", "pet_friendly", "free_wifi"]
Hotel
room_types / max_occupancy
All available room configurations (king, twin, suite,
accessible) with bed counts, square footage, view type, and maximum occupancy. Paired with per-room
pricing to power search and filter engines.
{"type": "king_suite", "sqft": 620, "max_occupancy":
3}
Hotel
cancellation_policy
Free cancellation deadline, partial refund windows, and
non-refundable flags per rate. Scraped per room type and OTA platform — since policies vary
significantly across the same property's listings.
{"free_cancel_before": "2025-06-12", "refundable":
true}
Hotel
availability_calendar
30/60/90-day rolling availability view showing open and
sold-out dates with dynamic pricing per date band. Enables demand forecasting, revenue management tools,
and last-minute deal detection.
{"2025-07-04": {"available": false}, "2025-07-05":
{"price": 310}}
Pricing
price_per_night_usd
Current base rate per night in USD, tracked with
timestamps. Enables real-time rate parity monitoring across OTAs and alerts when a hotel violates
minimum advertised price (MAP) agreements.
289.00
Pricing
ota_price_comparison
Same-property pricing captured simultaneously across
Booking.com, Expedia, Hotels.com, Kayak, and Hotwire. Price spreads of 5–20% are common — this field
makes arbitrage opportunities visible at scale.
{"booking_com": 289, "expedia": 294, "kayak": 285}
Pricing
taxes_and_fees
Itemized breakdown of resort fees, city taxes, cleaning
fees (vacation rentals), and OTA service charges — revealing true total cost that base-rate comparisons
consistently obscure.
{"resort_fee": 45, "city_tax": 22.50, "service_fee":
18}
Pricing
price_history
Historical rate snapshots timestamped up to 24 months.
Track seasonal pricing patterns, advance purchase discount curves, and promotional pricing events for
any property across the US market.
[{"date":"2025-01","price":210},
{"date":"2025-06","price":289}]
OTA
ota_listing_url
Direct listing URL on each OTA platform including
Booking.com, Expedia, Hotels.com, Priceline, and Travelocity — enabling deep-link integration into
comparison tools and price alert platforms.
"https://booking.com/hotel/us/midtown-grand..."
OTA
flash_deals / member_rates
Time-limited flash sale prices, loyalty member rates,
and promo codes scraped as they appear. Each deal includes the expiry timestamp, discount percentage,
and original reference price.
{"discount": "22%", "expires":
"2025-06-07T23:59:00Z"}
Vacation
Rental
listing_type / bedrooms
Property type (entire home, private room, shared room),
bedroom count, bathroom count, and sleeping capacity. Scraped from Airbnb, VRBO, and Vacasa with
consistent field normalization.
{"type": "entire_home", "bedrooms": 3, "bathrooms":
2}
Vacation
Rental
host_rating / superhost_status
Host-level review score, total reviews, response rate,
and superhost/premier host badge status. Key signal for trust-scoring models in rental marketplaces and
fraud detection pipelines.
{"score": 4.95, "superhost": true, "response_rate":
"99%"}
Reviews
review_sentiment_score
Aggregated sentiment score derived from structured
parsing of guest review text. Subscore breakdowns by category: cleanliness, location, value, service,
and amenities — enabling NLP-ready travel datasets.
{"overall": 8.7, "cleanliness": 9.1, "location": 9.4}
Location
geo_coordinates
Verified latitude/longitude with sub-50m accuracy.
Includes Google Plus Codes and neighborhood classification for mapping, proximity analysis, and
geofenced market segmentation.
{"lat": 40.7549, "lng": -73.984}
Location
address / zip / state
Full standardized US address normalized to USPS
standards including street, city, county, state, and ZIP+4. Enables reliable data matching,
deduplication, and geographic market segmentation.
"315 W 42nd St, New York, NY 10036"
Location
nearby_attractions / poi_distance
Distance in miles to top-ranked POIs including airports,
convention centers, beaches, and landmarks. Scraped and geo-validated for all 50 US states to power
"near me" and proximity filter features.
{"jfk_airport_mi": 14.2, "times_square_mi": 0.4}
Travel Intelligence
visa_requirements
Entry visa requirements and travel document rules based on traveler nationality and destination. Supports international travel compliance workflows.
{"visa_required":true,"evisa_available":true}