Case Study: Travel Booking Platform
For this travel booking platform, fraud was hiding in complexity. International bookings, third-party processors, and large-ticket transactions all made detection harder. While their model could flag some risk, it lacked the signal resolution needed to separate false positives from real threats.
Elephant brought clarity to the edge cases. The team improved fraud detection, reduced false positives, and gained the confidence to act on real trust rather than just red flags.
													This platform processes hundreds of thousands of flight bookings per month, including many that pass through third-party processors. High-value, high-risk bookings, like international or premium-class tickets, carried a higher risk. However, the system in place was not calibrated to identify who could actually be trusted.
False positives drained revenue. Missed fraud created chargebacks and policy abuse. Analysts lacked the signal precision needed to tune the system without tradeoffs. The company needed a way to improve model performance at both ends of the spectrum. Their priority was high-impact bookings where decisioning was most opaque.
High-trust users were scored poorly, not because of actual red flags, but because key signals were missing or unavailable
External processors stripped core user signals, making it harder to link booking data to a coherent identity and limiting the accuracy of risk scoring
Fraud was often flagged after booking confirmation, but by then the platform had already issues tickets, absorbed the cost, and couldn't intervene
Attributes like email and address weren't just weak, they were incomplete or unverifiable. The system couldn't interpret that gap, leaving fraud undetected
Elephant calibrated the trust score using 165,000 historical flight bookings. Instead of relying on perfect input data, the model was trained to recognize meaningful patterns in obscured or fragmented signals. It performed especially well on traffic routed through third-party processors, where identity clarity was lowest. The team tested a range of scoring thresholds to capture both high-risk and high-trust segments with precision.
											
											The breakthrough wasn't just in performance, but also in flexibility. Elephant gave the team a model that could make smart, confident decisions even when key attributes were missing or inconsistent.
Instead of defaulting to caution, analysts could act on trust scores that accounted for processor-level noise, international bookings, and limited visibility. This meant fraud was caught earlier, approvals moved faster, and the system adapted to complexity without overreacting to it.
ROC-AUC reached 0.87
Good detection reached 53% at an 850 threshold
Fraud detection reached 37% at a 250 threshold
False negative rate dropped to 1.6% at a 950 threshold