Large Scale Real Time Fraudulent Web Behavior Detection

REAL TIME FRAUDULENT WEB BEHAVIOR DETECTION
Jeff Niemann
Randal Hanak

2
Jeff Niemann
● Analyzing behavior at NeuroID for
4 years
● Focus on capturing behaviors
indicative of fraud
● Working with JavaScript and
Apache Flink

3
Randy Hanak
● Been with NeuroID for 1 year.
● Focusing on reliability,
performance, and scaling our
Flink pipeline as well as
downstream consumers.
● Also deployment and
developer experience in Flink
repositories.

Agenda
About NeuroId
Behavior Detection in Flink
High Level Architecture
Challenges and Solutions
Demo of Id Orchestrator

The Digital
Identity Crisis
is the defining challenge of
modern financial services.
4.5M fake users
Avis, Hertz refuse Chime payments
Selectively bans bank transfers

Post-submit data is causing the crisis
Critical identity decisions are all made post-submit

Pre-submit data enhances all other data
Maximize the identity investments you’ve already made using pre-submit

10
Web Page Behaviors
Genuine
- Know their personal information
Risky (Fraudulent)
- Don’t know personal information
Bots
- Rapidly filling form

High risk identity
1. Importing first & last name
into First Name field
1. Cutting last name out of First Name
field (Fraudster efficiency trick)
1. Form out-of-order – Entering in
the order the stolen data is stored
vs the form order
1. Navigating off application & back on
to SSN field (looking up info)
1. Hesitation throughout street address
entry – utilizing Short-Term Memory
(looking up info in chunks)
Risky Behaviors
Confidential & Proprietary Limited Distribution

Genuine identity
1. Navigating the form with focus
1. No mistakes entering
personal information
1. Personal details entered via
Long-Term Memory - no pauses
or hesitation
Genuine Behaviors

13
Bots/automated activity - what are they?
Automation of the onboarding process
● Automatic scripts that run through the form entering data
Why would you use a bot?
● For making accounts
● For prefill style attacks (insurance) - mining data off the page

14
What do they look like behaviorally?
Unusual/headless browsers
Fast typing (like super fast - 1000x faster than human)
Consistent typing
Fast transitions

Flink Keyed Processors

Flink Keyed Processors
Purpose: Group and annotate behaviors that are
indicative of fraud so that analytical tools can
quickly generate an outcome.

Rate Limiting Events
● Cleaning data as early as possible
● Expiring state on an interval

Aggregating Behavior
● Track types of fields interacted with
● Compare click events throughout session
● Group of events that represent a behavior

Database Sink
Send grouped events to a database
We will further discuss challenges and solutions in the next
session about scaling and speed.

Scaling and Speed

Scaling and Speed
● Score generated
within 3 seconds
of first interaction
on page
● Score continually
updated with
interactions

Scaling and Speed
● 161 million events processed per day.
● Events aggregated to sessions.
● Sessions processed in last 15 mins.

Scaling and Speed
How we scale downstream consumers.
● Kinesis trigger with batch
● S3 with ObjectCreated event
● Asynchronous invocation from flink to get required
concurrency.

Scaling and Speed
Kinesis consumer
● Record size. 1MB
● Kinesis with Standard Iterator. Lambda service polls each shard in your
stream one time per second for records using HTTP protocol. With batch
window 0, can achieve 200-millisecond data retrieval latency for one
consumer. Given one consumer can can read up to 5 times per second per
shard.
● Dedicated-throughput consumer with enhanced fan out that can achieve ~
70ms of latency. Stream consumers use HTTP/2 to push records to
Lambda over a long-lived connection.

Scaling and Speed
S3 consumer
● Record size not a problem.
● Downstream consumers are triggered by ObjectCreated event
which is an asynchronous invoke on Lambda allowing easy
scaling.
● Downside is for latency considerations you would want all
relevant data to be packed into one S3 file which requires
accumulation of that data in state with Flink.

Accumulating session in state
Summary: What Worked Well?
● Size of our messages.
● Messages are small and don’t require context of
previous messages in session the decision is a
bit easier. Could use Kinesis or S3.
● Messages require context of the session.

Scaling and Speed
Database consumer
● Record size 200KB.
● Downstream consumer can be triggered by flink with an asynchronous invoke
using the lambda api.
● Consumers than will be given an id and a range of items to read from
database.
● We partition our data if it’s greater than 200KB into items in a database table.
Allows us to read backwards for a session and grab relevant records quickly in
the lambda.

Scaling and Speed
Summary
● Pick your transport by requirements on latency, size of messages, and
required concurrency.
● Use a async keyed process function to store records to database.
● Invoke the lambda from flink. Scaling it to reduce latency.

Monitoring and Alerting

Identifying issues

Issues resolved

Monitoring Size

Blue Green Deployments

Blue/Green Deployments
Deploying new
versions of our Flink
processors without
interrupting current
sessions.

Blue/Green Deployments
Additional details
● Properly keeping the mapping of clients to streams and
providing a default.
● Keeping latency low while maintaining this mapping.
● Metrics for finding any bugs, found issues that only
showed up during a traffic switch.

Final Learnings

What worked well
● Transition from kryo to pojo serializer

What worked well
● Avoid uncompressed records being transferred over the network
● Ensure functions end up in the same operator group. This way
records can be passed efficiently in between them. Partition key and
parallelism are some of the requirements to be in the same group.

PyFlink Learnings
● Initially worked with PyFlink because we were
comfortable with Python
● PyFlink wasn't a good choice for performance
reasons
● Also deployment using python with KDA wasn't
an option

Demo

Large Scale Real Time Fraudulent Web Behavior Detection

More Related Content

More from Flink Forward (20)

Large Scale Real Time Fraudulent Web Behavior Detection

Editor's Notes