In Database Machine Learning Service through SQL Server

When we design the ML system,normaly we do data movement from the database for data transformation and model training purpose through the feature engineering pipeline.But it consumes more resources to create this data pipeline.Following image desribes entire process for this type of ML system.

Instead of this kind of design,we can do data transformations,model training and scoring inside the databse.Using the SQL server,It can be enabled this feature in our ML system.Then it is not required to build a feature engineering(data movement is not required) pipeline separately.Then entire process as follows.

Using this design concept, I am going to solve following business problem.

Business Problem

Our organization unraveled a case of expense report fraud among several employees.

Employees were allowed to submit expense reports of up to $40 without a receipt, but needed to include a receipt for everything above. Several employees began submitting fraudulent claims under that amount.

Management would like to know how much we believe employees took above and beyond the expected amounts.

To solve this problem,expenses amount of the person will be predited using the Randomforest rgression algorithm.In here no need to build data pipeline.But instead training and inference component should be build inside th database.SQL server "stored prcedure" has been used for creating those two pipeline inside th database.

EDA

Results

True Value for Expense Amount

Predicted Value for Expense Amount

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Data		Data
Images		Images
Results		Results
__pycache__		__pycache__
README.md		README.md
dependency.py		dependency.py
eda.sql		eda.sql
inference.sql		inference.sql
training.sql		training.sql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

In Database Machine Learning Service through SQL Server

About

Uh oh!

Releases

Packages

Languages

Kelum-senaka/Fraudulent_Claims_Identification

Folders and files

Latest commit

History

Repository files navigation

In Database Machine Learning Service through SQL Server

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages