Flight 航班
flight_number / carrier_code
航班号 / 航空公司代码
IATA airline code and flight number scraped in real-time from all Chinese airline websites and OTA aggregators. Covers Air China (CA), China Eastern (MU), China Southern (CZ), Hainan Airlines (HU), Xiamen Air (MF), Shenzhen Airlines (ZH), and all 40+ CAAC-licensed domestic carriers across 260+ Chinese airports.
"CA1234" / "CA" (Air China 中国国际航空)
Flight 航班
origin / destination
出发地 / 目的地
IATA airport codes and Chinese city names (English + Simplified Chinese), normalised across China's 260+ civil airports including major hubs — Beijing Capital (PEK), Beijing Daxing (PKX), Shanghai Pudong (PVG), Shanghai Hongqiao (SHA), Guangzhou Baiyun (CAN), Shenzhen (SZX), and Chengdu Tianfu (TFU).
{"origin": "PEK", "origin_zh": "北京首都", "destination": "PVG"}
Flight 航班
departure_datetime / arrival_datetime
出发时间 / 到达时间
Scheduled and actual timestamps in CST (UTC+8) and UTC. China operates a single unified timezone across its vast territory — no daylight saving adjustment required. Includes real-time delay status and Civil Aviation Administration of China (CAAC) ATC hold codes for slot-constrained airports like PEK, PVG, and CAN.
"2025-10-01T08:00:00+08:00" (CST, Golden Week)
Flight 航班
fare_class / cabin
舱位等级 / 客舱
Booking class code and cabin type with seat availability counts. Tracks China Eastern's Business vs. Economy bundles alongside budget carrier (9 Air, Ruili Airlines) pricing, and Air China's full-service First/Business/Premium Economy tiers — enabling cross-carrier yield analysis across China's 40+ domestic airlines on a unified schema.
{"class": "Y", "cabin": "economy", "seats_left": 8}
Flight 航班
caac_slot_restriction
时刻表限制
CAAC slot-restriction flag for slot-controlled Chinese airports (Beijing PEK/PKX, Shanghai PVG/SHA, Guangzhou CAN). Slot-controlled status directly impacts schedule reliability, delay probability, and capacity availability — a China-specific signal critical for corporate travel and insurance risk models.
{"airport": "PEK", "slot_controlled": true, "on_time_pct_90d": 68.2}
Flight 航班
chunyun_demand_flag
春运需求标志
Spring Festival travel rush (春运, Chunyun) demand flag — the world's largest annual human migration, moving 3B+ trips over 40 days. Flags fare surge windows, sold-out status, and premium fare multipliers by route during Chunyun, China's most extreme travel demand event with no equivalent in any other country's travel data.
{"chunyun_window": true, "surge_mult": 3.2, "route_sold_out": true}
Hotel 酒店
hotel_name_en / hotel_name_zh
酒店名称(英文/中文)
Full property name in both English and Simplified Chinese, normalised across Ctrip, Qunar, Fliggy, Meituan, and Booking.com. Dual-language field is a China-specific requirement as Chinese OTA names frequently differ from international brand names in ways that cause deduplication failures without explicit bilingual matching.
{"en": "The Peninsula Shanghai", "zh": "上海半岛酒店"}
Hotel 酒店
china_star_rating / ota_score
中国星级评定 / OTA评分
Official China National Tourism Administration (CNTA) star rating (1–5 stars, with Diamond ratings) alongside platform-specific guest scores from Ctrip (4.8/5 scale), Meituan, and Qunar. CNTA ratings differ materially from international star systems — a critical normalisation layer for cross-border hotel comparison products.
{"cnta_stars": 5, "cnta_diamond": true, "ctrip_score": 4.9}
Hotel 酒店
amenities
设施与服务
Standardised list of 70+ amenity flags including China-specific features — UnionPay acceptance, WeChat Pay / Alipay QR at property, in-room Chinese TV channels, 24hr hot water, toothbrush/slipper kit (ubiquitous in China), airport shuttle, mahjong room, karaoke suite, and local government-required safety amenities.
["wechat_pay", "alipay", "airport_shuttle", "karaoke_room", "24hr_hot_water"]
Hotel 酒店
vat_6pct_breakdown
增值税明细
China's 6% VAT (增值税) itemisation per rate — base room rate, VAT amount, and total in CNY. Includes city-specific tourism development fund surcharges where applicable. VAT input credit is reclaimable by Chinese corporate travellers, making itemised VAT data essential for enterprise T&E reimbursement and tax compliance tools.
{"base_cny": 3800, "vat_6pct": 228, "total_cny": 4028}
Hotel 酒店
room_types / bed_configuration
房型 / 床位配置
All available room configurations with Chinese-specific naming conventions (Standard Room 标准间, Deluxe Room 豪华间, Executive Room 行政间, Suite 套房) plus bed type (twin, king, queen), view, floor level, and in-room internet type (wired LAN vs. WiFi). Bed type normalisation is critical as Chinese hotel naming conventions differ significantly from Western standards.
{"type": "豪华大床房", "bed": "king", "view": "bund_view", "floor": "high"}
Hotel 酒店
foreigner_registration_flag
外籍住客登记标志
Indicates whether a property is licensed to accept foreign passport holders — a China-specific compliance requirement under Public Security Bureau (PSB) regulations. Many budget hotels and B&Bs in China cannot legally accommodate foreigners. This field is critical for international travel tools targeting inbound China travellers.
{"accepts_foreigners": true, "psb_registered": true}
High-Speed Rail 高铁
train_number / train_type
车次 / 列车类型
China Railway train number and official type classification scraped from 12306 (China's national rail booking system) and third-party rail apps. Types include G-trains (高速动车组, 350km/h+), D-trains (动车组, 250km/h), C-trains (城际, intercity), K-trains (快速, conventional express), and T-trains (特快, express) across China's 42,000km HSR network.
{"train_no": "G1", "type": "G", "max_speed_kmh": 350}
High-Speed Rail 高铁
seat_class_availability
座席等级可用性
Real-time seat availability by class — Business Class (商务座), First Class (一等座), Second Class (二等座), and No Seat (无座) — refreshed every 10 minutes from 12306. Tracks both available seat counts and standing-ticket availability during high-demand periods. Essential for the world's most used rail booking system (1B+ annual transactions).
{"business": 3, "first_class": 0, "second_class": 42, "no_seat": 88}
High-Speed Rail 高铁
ticket_release_schedule
放票时间表
12306 ticket release window — Chinese rail tickets open 15 days in advance (extended to 30 days during peak periods). This field tracks the exact release timestamp per train and date, enabling ticket alert products to fire notifications at the precise moment tickets become purchasable — critical during Chunyun when G-train tickets sell out within seconds.
{"release_date": "2025-09-16T08:00:00+08:00", "advance_days": 15}
High-Speed Rail 高铁
hsr_fare_cny
高铁票价(人民币)
Official China Railway fare in CNY per seat class and distance. HSR fares are government-regulated with a published base rate but variable discounts (打折票) of 75%, 80%, and 90% of full price — scraped in real-time to capture discount ticket windows unique to the Chinese rail pricing mechanism.
{"second_class": 553, "first_class": 933, "discount_available": "80%"}
OTA Platform 平台
ota_listing_url
平台链接
Direct listing URL on each major Chinese OTA platform — Ctrip (携程), Qunar (去哪儿), Fliggy (飞猪, Alibaba), Meituan (美团), eLong (艺龙), and Booking.com China. Deep-link integration enables Chinese price comparison tools and WeChat Mini Program travel alerts without manual URL construction.
"https://hotels.ctrip.com/hotel/4219.html"
Super-App Integration 超级应用
wechat_miniprogram_id / alipay_id
微信小程序 / 支付宝
WeChat Mini Program deep-link ID and Alipay Mini Program ID for the property or flight — enabling travel products built on China's super-apps to link directly into booking flows. Unlike the open web, most Chinese travel transactions occur inside WeChat or Alipay ecosystems, making these IDs critical for China-market product development.
{"wechat_mp": "wx_ctrip_hotel_4219", "alipay_mini": "fliggy_htl_sha_4219"}
Pricing 价格
price_per_night_cny
每晚价格(人民币)
Current base rate per night in CNY (incl. 6% VAT), tracked with CST timestamps for real-time rate parity monitoring across Chinese OTA channels. Also available in USD and HKD for cross-border travel products. Price tracking captures both regular rates and platform-exclusive "秒杀" (flash kill) discount prices unique to Chinese OTAs.
{"cny_inc_vat": 4028, "usd_approx": 555, "hkd_approx": 4340}
Pricing 价格
ota_price_comparison
OTA价格比较
Same-property pricing captured simultaneously across Ctrip, Qunar, Fliggy, Meituan, eLong, and the hotel's direct booking channel. Chinese OTA price spreads of 10–30% are common due to platform coupons (优惠券), member prices (会员价), and point redemption offers — this field normalises all to CNY cash equivalent for true comparison.
{"ctrip": 3800, "qunar": 3650, "fliggy": 3720, "meituan": 3590}
Golden Week 黄金周
golden_week_demand_calendar
黄金周需求日历
China's Golden Week (黄金周) demand calendar with hotel inventory drawdown and fare surge data — covering National Day Golden Week (Oct 1–7), Spring Festival Golden Week (variable lunar), and May Day (May 1–5). Chinese hotel and flight prices surge 4–8x during Golden Week; this field provides historical ADR uplift by city and property category dating back 24 months.
{"event": "National_Day_GW_2025", "adr_uplift_pct": 520, "inventory_pct_sold": 94}
Location 位置
province / city / district
省 / 市 / 区
Full Chinese administrative hierarchy — province (省), prefecture-level city (地级市), county-level city (县级市), district (区), and sub-district (街道) — in both English and Simplified Chinese. Covers all 34 provincial-level divisions including 23 provinces, 5 autonomous regions, 4 municipalities (Beijing, Shanghai, Tianjin, Chongqing), and 2 SARs (Hong Kong, Macau).
{"province": "上海市", "city": "Shanghai", "district": "黄浦区 Huangpu"}
Location 位置
geo_coordinates / gaode_id
地理坐标 / 高德ID
Verified WGS-84 and GCJ-02 (Mars Coordinate System — China's mandatory offset system) latitude/longitude plus AutoNavi (高德, Gaode) POI ID. GCJ-02 coordinates are required for all China-market mapping integrations on Baidu Maps, Gaode Maps, and WeChat Location Services — WGS-84 alone is insufficient for accurate China geo-mapping.
{"wgs84": [31.240, 121.489], "gcj02": [31.241, 121.495], "gaode_id": "B0FFHNTF"}
Reviews 评价
review_sentiment_score
评论情感分析
Aggregated sentiment score from structured parsing of Simplified Chinese and English guest reviews across Ctrip, Meituan, Qunar, and Dianping. Subscore breakdowns by cleanliness (干净度), location (位置), service (服务), value (性价比), and food (餐饮) — enabling NLP-ready Chinese travel datasets for AI model training with proper UTF-8 Chinese text handling.
{"overall": 4.9, "value_for_money": 4.1, "service": 5.0, "cleanliness": 4.8}
Visa & Travel Compliance 签证与出入境
visa_requirement / transit_policy
签证要求 / 过境政策
Visa requirement, visa-free eligibility, transit-without-visa (TWOV) status, and permitted stay duration for inbound and outbound China travel. Covers China's 240-hour visa-free transit program, Hainan visa-free entry scheme, and bilateral visa-waiver agreements. Essential for OTA platforms, corporate travel tools, and international visitor planning where entry rules vary significantly by nationality and port of entry.
{
"visa_free_transit": true,
"max_stay_hours": 240,
"entry_city": "Shanghai",
"nationality": "Germany"
}