Community
IBM Community Home
AIOps & Management
Business Analytics
Business Automation
Cloud Pak for Data
Data Science
DataOps
Hybrid Data Management
IBM Japan
IBM Z & LinuxONE
Integration
Internet of Things
Power Systems
Public Cloud
Network Automation
Security
Storage
Supply Chain
Watson Apps
WebSphere & DevOps
Log in
Announcements
Blogs
Groups
Discussions
Events
Glossary
Site Content
Libraries
on this day
between these dates
Posted by
Announcements
Blogs
Groups
Discussions
Events
Glossary
Site Content
Libraries
on this day
between these dates
Posted by
Skip to main content (Press Enter).
Sign in
Skip auxiliary navigation (Press Enter).
Data Science
Topic groups
Global Data Science
Decision Optimization
SPSS Modeler
SPSS Statistics
Watson Studio
Data and AI Learning
User groups
Events
Upcoming Events
On Demand Webinars
IBM Expert TV
Virtual Community Events
All IBM Community Events
Participate
Post to Forum
Share a Resource
Blogging on the Community
Connect with Data Science Users
All IBM Community Users
Data Science Elite
Resources
IBM Support
IBM Cloud Support
IBM Champions
Demos
Marketplace
Marketplace
IBM Data Science Community
Master the art of data science.
Complimentary Coursera offer for all new members
Get offer
Join without the offer
Skip main navigation (Press Enter).
Toggle navigation
Content types
Announcements
Blogs
Groups
Discussions
Events
Glossary
Site Content
Libraries
Date range
on this day
between these dates
Posted by
Global Data Science Forum
View Only
Group Home
Discussion
1.3K
Library
179
Blogs
424
Events
10
Members
16.2K
Back to Blog List
Forecasting with Lead Regression
By
Moloy De
posted
Thu January 14, 2021 09:35 PM
Options Dropdown
Mark as Inappropriate
0
Recommend
We got a pioneering company manufacturing Roof Shingles in Minnesota, US.
Client was progressing towards implementing Industry 4.0 benchmark and developing analytics for their plants is an important activity in it. They
implemented 100’s of sensors along their assembly line that are streaming nano-second data to their Spark Data-lake.
Viscosity of input fluid is an important factor to maintain quality of production.
Data showed there are unwanted peaks (outliers) in viscosity data which client wanted to eliminate. Following are the steps they thought of
1. Monitoring Viscosity continuously in a dashboard at the plant
2. Finding the significant contributors in the fluctuations of the Viscosity values
3. Perform a root cause analysis (RCA) of the unwanted peaks
We implemented SPARK Repository to hold sensor records and displayed them in a plant dashboard. M
ultiple regression in R was used to find out significant contributors in viscosity fluctuations.
Decision Tree was deployed to perform the RCA for the Web Tears (broken roof shingles) and
Lead Regression was used to forecast viscosity. The modelling was successful and was implemented using SparkR.
QUESTION I : How one can detect significant factors from a linear regression?
QUESTION II : How one can detect significant factors from a decision tree analysis?
#forecasting
#Highlights-home
#Highlights
0 comments
1736 views
×
Reason for Moderation
Describe the reason this content should be moderated (required)
Permalink
Data Science
Topic groups
Global Data Science
Decision Optimization
SPSS Modeler
SPSS Statistics
Watson Studio
Data and AI Learning
User groups
Events
Upcoming Events
On Demand Webinars
IBM Expert TV
Virtual Community Events
All IBM Community Events
Participate
Post to Forum
Share a Resource
Blogging on the Community
Connect with Data Science Users
All IBM Community Users
Data Science Elite
Resources
IBM Support
IBM Cloud Support
IBM Champions
Demos
Marketplace
Marketplace
Copyright © 2019 IBM Data Science Community. All rights reserved.
Powered by Higher Logic