Community
Search Options
Search Options
Log in
Skip to main content (Press Enter).
Sign in
Skip auxiliary navigation (Press Enter).
AI and Data Science
Topic areas
AI and DS Skills
Decision Optimization
Embeddable AI
Global AI and Data Science
IBM Advanced Studies
SPSS Statistics
watsonx Assistant
Watson Discovery
User groups
Events
TechXchange Day
IBM TechXchange Conference
Upcoming AI Events
IBM TechXchange Webinars
All IBM TechXchange Community Events
Participate
Gamification Program
Community Manager's Welcome
Post to Forum
Share a Resource
Share Your Expertise
Blogging on the Community
Connect with Data Science Users
All IBM TechXchange Community Users
Resources
IBM TechXchange Group
AI Learning
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Support 101
IBM Technology Zone
IBM Training
TechXchange Day
Marketplace
Marketplace
AI and Data Science
Master the art of AI and Data Science.
Ask a question
Join us for IBM TechXchange Day: AI and Automation
Skip main navigation (Press Enter).
Toggle navigation
Search Options
Global AI and Data Science
Group Navigator
View Only
Community Home
Discussion
2.3K
Library
280
Blogs
754
Events
8
Members
25.6K
Share
Forecasting with Lead Regression
By
Moloy De
posted
Thu January 14, 2021 09:35 PM
0
Like
We got a pioneering company manufacturing Roof Shingles in Minnesota, US.
Client was progressing towards implementing Industry 4.0 benchmark and developing analytics for their plants is an important activity in it. They
implemented 100’s of sensors along their assembly line that are streaming nano-second data to their Spark Data-lake.
Viscosity of input fluid is an important factor to maintain quality of production.
Data showed there are unwanted peaks (outliers) in viscosity data which client wanted to eliminate. Following are the steps they thought of
1. Monitoring Viscosity continuously in a dashboard at the plant
2. Finding the significant contributors in the fluctuations of the Viscosity values
3. Perform a root cause analysis (RCA) of the unwanted peaks
We implemented SPARK Repository to hold sensor records and displayed them in a plant dashboard. M
ultiple regression in R was used to find out significant contributors in viscosity fluctuations.
Decision Tree was deployed to perform the RCA for the Web Tears (broken roof shingles) and
Lead Regression was used to forecast viscosity. The modelling was successful and was implemented using SparkR.
QUESTION I : How one can detect significant factors from a linear regression?
QUESTION II : How one can detect significant factors from a decision tree analysis?
#forecasting
#GlobalAIandDataScience
#GlobalDataScience
#Highlights
#Highlights-home
0 comments
3252 views
Permalink
IBM Community Home
Browse
Discussions
Resources
Groups
Events
IBM TechXchange Conference 2023
IBM Community Webinars
All IBM Community Events
Participate
Gamification Program
Community Manager's Welcome
Post to Forum
Share a Resource
Blogging on the Community
All IBM Community Users
Resources
Community Front Porch
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Technology Zone
IBM Training
Marketplace
Marketplace
AI and Data Science
Topic areas
AI and DS Skills
Decision Optimization
Embeddable AI
Global AI and Data Science
IBM Advanced Studies
SPSS Statistics
watsonx Assistant
Watson Discovery
User groups
Events
TechXchange Day
IBM TechXchange Conference
Upcoming AI Events
IBM TechXchange Webinars
All IBM TechXchange Community Events
Participate
Gamification Program
Community Manager's Welcome
Post to Forum
Share a Resource
Share Your Expertise
Blogging on the Community
Connect with Data Science Users
All IBM TechXchange Community Users
Resources
IBM TechXchange Group
AI Learning
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Support 101
IBM Technology Zone
IBM Training
TechXchange Day
Marketplace
Marketplace
Powered by Higher Logic