SlideShare a Scribd company logo
Copyright © 2015 KNIME.com AG
Биоинформатик в тридевятом
царстве, или двое программистов из
ларца KNIME
Oleg Yasnev
KNIME.com
Copyright © 2015 KNIME.com AG 2
А вы что же за
меня и код
писать будете?
Ага!
Кадр из мультфильма «Вовка в тридевятом царстве» © «Союзмультфильм»
Copyright © 2015 KNIME.com AG 3
KNIME.com
3
Copyright © 2015 KNIME.com AG 4
KNIME.com
• KNIME.com founded in 2008
• Offices in Zurich, San Francisco (Aug ‘13), Berlin
(May ‘14) and Konstanz (October ‘15)
• 15 open source releases, 10 product releases (in
2014)
• >2m lines of code
• 600k lines of community code
4
Copyright © 2015 KNIME.com AG 5
Advanced
Analytics
Pharma
Health Care
Finance
Retail
Customer
Intelligence
Manu-
facturing
Broad Range of KNIME Application Areas
5
Copyright © 2015 KNIME.com AG 6
The KNIME Analytics Platform
6
Copyright © 2015 KNIME.com AG 7
From Access to Visualization and Deployment
Copyright © 2015 KNIME.com AG 8
Data Access
• Databases
– MySQL, PostgreSQL
– any JDBC (Oracle, DB2, MS SQL
Server)
• Files
– Csv, txt
– Excel, Word, PDF
– SAS, SPSS
– XML
– PMML
– Images, texts, networks, chem
• Web, Cloud
– REST, Web services
– Twitter, Google
Copyright © 2015 KNIME.com AG 9
Big Data
• HDFS support
• Hive
• Impala
• HP Vertica
• In-database processing
Copyright © 2015 KNIME.com AG 10
Transformation
• Preprocessing
– Row, column, matrix based
• Data blending
– Join, concatenate, append
• Aggregation
– Grouping, pivoting, binning
• Feature Creation and
Selection
Copyright © 2015 KNIME.com AG 11
Analyze & Data Mining
• Regression
– Linear, Logistic
• Classification
– Decision tree, ensembles,
SVM, MLP, Naïve Bayes
• Clustering
– k-means, DBSCAN, hierarchical
• Validation
– Cross-validation, scoring, ROC
• Misc
– PCA, MDS, item set mining
• External
– R, Weka
Copyright © 2015 KNIME.com AG 12
Visualization
• Interactive
– Scatter plot, histogram, pie
charts, box plot
– Highlighting (brushing)
• JFreeChart
• JavaScript
• Misc
– Tag cloud, open street map,
networks, molecules
• External
– R
Copyright © 2015 KNIME.com AG 13
Deployment
• Database
• Files
– Excel, csv, txt
– XML
– PMML
– to: local, KNIME Server,
SSH-, FTP-Server
• BIRT Reporting
Copyright © 2015 KNIME.com AG 14
Statistics
Data Mining
Machine Learning
Web Analytics
Text Mining
Network Analysis
Social Media Analysis
WEKA
R
Community / 3rd
MySQL, Oracle, etc.
SAS, SPSS, etc.
Excel, Flat, etc.
Hive etc.
XML, PMML
Text, Doc, Image
Web Crawlers
Industry Specific
Community / 3rd
ETL
Row,
Column
Matrix
Text, Image
Time Series
Java
Python
Community / 3rd
R
JFreeChart
Community / 3rd
via BIRT
PMML
XML
Databases
Excel, Flat, etc.
Hive etc.
Text, Doc, Image
Industry Specific
Community / 3rd
Over 1000 native and embedded nodes included:
14
Copyright © 2015 KNIME.com AG 15
KNIME: Integrating Data and Tools
15
Copyright © 2015 KNIME.com AG 16
Big Data.
Pre-processing on Hadoop
Copyright © 2015 KNIME.com AG 17
In-Database Processing
17
Loads your pre-processed
data into KNIME
Copyright © 2015 KNIME.com AG 18
Reader/Writer
• Table selection
• Load data into KNIME
• Create table as select
• Insert/append data
• Delete rows from table
• Update values in table
18
Copyright © 2015 KNIME.com AG 19
Hive/Impala Loader
• Upload a KNIME data table to Hive/Impala
• Part of the commercial Big Data Extension
19
Copyright © 2015 KNIME.com AG 20
Manipulation
• Filter rows and
columns
• Join tables/queries
• Sort your data
• Write your own query
• Aggregate your data
20
Copyright © 2015 KNIME.com AG 21
Database GroupBy – Manual Aggregation
21
Copyright © 2015 KNIME.com AG 22
Database GroupBy – Type Based Aggregation
22
Matches
all cells
Matches
all numeric
cells
Copyright © 2015 KNIME.com AG 23
Utility
• Drop table
– missing table handling
– cascade option
• Execute any SQL
statement e.g. DDL
• Manipulate existing
queries
23
Copyright © 2015 KNIME.com AG 24
HDFS File Handling
• New nodes
– HDFS Connection
– HDFS File Permission
• Utilize the existing remote
file handling nodes
– Upload/download files
– Create/list directories
– Delete files
24
Copyright © 2015 KNIME.com AG 25
HDFS File Handling
25
Copyright © 2015 KNIME.com AG 26
Workflow 1: PrepareData
26
~ 2 days
Irish Smart Energy Meter Trials
• July 2009 – Dec 2010
• 6000 meters
• roughly 176m rows of data
Copyright © 2015 KNIME.com AG 27
Import Data from Database into KNIME
27
< 30 min
Copyright © 2015 KNIME.com AG 28
Big Data.
Machine Learning on Hadoop
Copyright © 2015 KNIME.com AG 29
Machine Learning on Hadoop
• Based on Spark MLlib
• Scalable machine learning library
• Runs on Hadoop
• Algorithms for
– Classification (decision tree, naïve bayes, …)
– Regression (logistic regression, linear regression, …)
– Clustering (k-means)
– Collaborative filtering (ALS)
– Dimensionality reduction (SVD, PCA)
29
Copyright © 2015 KNIME.com AG 30
MLlib Integration
• Usage model and dialogs similar to existing nodes
• No coding required
Copyright © 2015 KNIME.com AG 31
MLlib Integration
• MLlib model ports for model transfer
• Native MLlib model learning and prediction
• Spark nodes start and manage Spark jobs
• Supports Spark job cancelation
Native MLlib model
Copyright © 2015 KNIME.com AG 32
MLlib Integration
• Spark RDDs as input/output format
• Data stays within your cluster
• No unnecessary data movements
• Several input/output nodes e.g. Hive, hdfs files, …
Copyright © 2015 KNIME.com AG 33
Mass Learning – Fast Event Prediction
• Convert supported MLlib models to PMML
• Mass learning on Hadoop
• Fast event prediction based on compiled models
Copyright © 2015 KNIME.com AG 34
Mix and Match
• Combine with existing KNIME nodes
Copyright © 2015 KNIME.com AG 36
Modularize and Execute Your Own Spark Code
Copyright © 2015 KNIME.com AG 37
Spark Node Overview
Copyright © 2015 KNIME.com AG 38
А что же Rocket Science?
38
Copyright © 2015 KNIME.com AG 39
Community Contributors
39
Technology
Partners
Distribution
& Consulting
Partners
Community
Contributors
Community
User Base
Donated by
Companies
Contributions from
Research
Institutions
Maintained by
KNIME
Copyright © 2015 KNIME.com AG 40
Community Contributors
40
Technology
Partners
Distribution
& Consulting
Partners
Community
Contributors
Community
User Base
Academic Institutions:
• Universität Tübingen (BALL, OpenMS)
• Freie Universität Berlin (SeqAn)
• MPI Dresden (ImgLib)
• Universität Dresden (Palladin)
• ETH Zürich (OpenBIS)
• Dublin University (OMERO)
• University of Wisconsin (ImageJ2)
• …
Commercial Contributors:
• Dymatrix Consulting Group (Uplift Nodes)
• Eli Lilly (ChemInf suite)
• Novartis (RDKit, Indigo)
• Vernalis (Proteomics)
• Cenix (REST Nodes)
• Böhringer-Ingelheim (various sponsored nodes)
• …
Copyright © 2015 KNIME.com AG 41
Bioinformatics
https://tech.knime.org/bioinformatics-and-next-
generation-sequencing-extensions
Copyright © 2015 KNIME.com AG 42
OpenMS
Open-source software C++ library for
liquid chromatography–mass
spectrometry data management and
analyses.
https://tech.knime.org/community/bio
inf/openms
Copyright © 2015 KNIME.com AG 43
SeqAn
Open-source C++ library of efficient
algorithms and data structures for the
analysis of sequences with the focus
on biological data.
https://tech.knime.org/seqan-nodes-
for-knime
Copyright © 2015 KNIME.com AG 44
NGS
Nodes and workflows used for
processing next generation
sequencing results
https://tech.knime.org/community/n
ext-generationsequencing
Copyright © 2015 KNIME.com AG 45
knime4bio
Set of custom nodes for analysing NGS
data
https://code.google.com/p/knime4bio/
Copyright © 2015 KNIME.com AG 46
Image Processing
https://tech.knime.org/community/image-processing
Copyright © 2015 KNIME.com AG 47
Active Classification in Cell Assay Images
• Different modules for segmentation and
feature extraction
• Active Learning
Copyright © 2015 KNIME.com AG 48
Active Classification in Cell Assay Images
CellMiner Nodes
Plate/Image Reading
– Plate Reader, Plate Editor, Plate View
Preprocessing
– Noise Filtering, Lowpass Filter
Segmentation
– Threshold based Segmentation,
Voronoi Segmentation
Features
– Line, Histogram, Texture, RGB, Zernike Moments, Shape
Active Classification
Copyright © 2015 KNIME.com AG 49
Chemistry and Cheminformatics
https://tech.knime.org/cheminformatics-extensions
Copyright © 2015 KNIME.com AG 50
Selected Open Source extensions
50
Copyright © 2015 KNIME.com AG 51
Selected commercial extensions
51
Copyright © 2015 KNIME.com AG 52
Overview of types in KNIME
52
• Basic KNIME types
• string, integer, double
• KNIME core chemistry types:
• smiles, sdf, mol, mol2
• Structures in these formats can
be rendered in KNIME tables
Copyright © 2015 KNIME.com AG 53
Nodes for type manipulation
53
• Molecule Type Cast
• Casts any string as a chemical type (i.e. It
tells KNIME “This is a smiles string”)
• Useful when reading data form a csv file or
database.
• Marvin MolConverter
• Provided by Chemaxon/Infocom
• Translates seamlessly between types
(smiles  sdf  mrv)
Copyright © 2015 KNIME.com AG 54
Nodes for reading and writing files
54
Reader and writers provided for:
- sdf, smiles, mol, mol2
Copyright © 2015 KNIME.com AG 55
Sketching chemical structures – use Marvin
55
MarvinSketch
• Provided by Chemaxon/Infocom
• Sketch structures in the configuration dialog
• Execute node to inject structures into workflow
Copyright © 2015 KNIME.com AG 56
RDKit
56
• Open source cheminfo library in c++
• Wrappers for KNIME maintained by the open
source community
• Useful for:
Descriptor calculation
Cleaning structures
InChi conversion
Standardizing smiles
Fingerprints
Scaffolds/substructures
Reaction simulation
and more…
Copyright © 2015 KNIME.com AG 57
Infocom JChem KNIME Nodes
Extensions of ChemAxon’s tool for
KNIME workflow
Infocom implements it with the
support of ChemAxon
Contains over 90% of ChemAxon's
cheminformatics functionality
Copyright © 2015 KNIME.com AG 58
ChEMBL
58
A public database of bioactive druglike compounds
~1.3 mio compounds
~ 9k targets
~12 mio bioactivitities
Provided by the European Bioinformatics Institute
Accessible online at www.ebi.ac.uk/chembl
or via EBI provided KNIME nodes…
Copyright © 2015 KNIME.com AG 59
New Node: ChEMBLdb Connector
59
Access data in ChEMBL via a web service call
(internet access required)
Lookup by ChEMBLID or InChi Key
Retrieve structure and bioactivity data
Compound search using smiles
exact, similarity, or substructure
Copyright © 2015 KNIME.com AG 60
Tool Integrations
Copyright © 2015 KNIME.com AG 61
• Select the KNIME version for your computer
– (Mac, Win, Linux)
• Copy to your local machine
• Unpack the file in a “nice” place
Install KNIME
61
Copyright © 2015 KNIME.com AG 62
Start KNIME
Go to the installation directory and launch KNIME.
62
Copyright © 2015 KNIME.com AG 63
The Workspace
• The workspace is the folder in which workflows (and
potentially data files) for the current KNIME session is
stored.
• Workspaces are portable (just like KNIME)
63
Copyright © 2015 KNIME.com AG 64
Starting KNIME for the first time
64
Install additional
extensions
Goes straight to the
KNIME workbench
Copyright © 2015 KNIME.com AG 65
The KNIME Workbench
65
Copyright © 2015 KNIME.com AG 66
A basic workflow
66
Copyright © 2015 KNIME.com AG 67
More on nodes…
A node can have 3 states:
67
Idle:
The node is not yet configured and can not
be executed with it’s current settings.
Configured:
The node has been set up correctly, and may
be executed at any time
Executed:
The node has been successfully executed. Results
may be viewed and used in downstream nodes.
Copyright © 2015 KNIME.com AG 68
Node configuration
• Most nodes require configuration
• To access a node configuration window:
• Double-click the node
• Right-click > Configure
68
Copyright © 2015 KNIME.com AG 69
Node execution
• Right-click node
• Select Execute in context menu
• If execution is successful, status shows
green light
• If execution encounters errors, status
shows red light
69
Copyright © 2015 KNIME.com AG 70
Node views
• Right-click node
• Select Views in context menu
• Select output port to inspect execution
results
70
Copyright © 2015 KNIME.com AG 71
Hotkeys (for future reference)
71
Copyright © 2015 KNIME.com AG 72
A Peak under the Hood:
KNIME (Node) Development
72
Copyright © 2015 KNIME.com AG 73
Node Architecture
• KNIME interacts only with
a Node
• Node takes care of
embedding the node in
the infrastructure
• New nodes implement
Model/View/Dialog
73
class Node
(final)
class
Node-
Dialog-
Pane
(abstract)
class
Node-
View
(abstract)
class
Node-
Model
(abstract)
class NodeFactory (abstract)
Copyright © 2015 KNIME.com AG 74
Node Extension Wizard
• Included in the KNIME Developer Version
• Allows creation of plugin projects including
functioning KNIME nodes (with sample code)
• Helpful to easily create all node classes
– Generates all Java classes
– Node is registered with the plugin project
– Launch KNIME and enjoy the new node working!
74
Copyright © 2015 KNIME.com AG 75
Node Extension Wizard
75
Copyright © 2015 KNIME.com AG 76
Node Extension Wizard
• Specify all settings to
create a new KNIME node
– In a completely new plugin
project, or
– Into an existing project
• Node type: Sink, Source,
Learner, Predictor,
Manipulator, Visualizer,
Meta, or Other
• Include sample code or
not
76
Copyright © 2015 KNIME.com AG 77
Node Extension Wizard
• Contains all Java
classes (including
sample code)
• Node is registered in
the plugin.xml
• NodeDialog and
NodeView class are
also created and
registered to the
NodeFactory
77
Copyright © 2015 KNIME.com AG 78
Node Development
78
Copyright © 2015 KNIME.com AG 79
Resources
• KNIME pages (www.knime.org)
• APPLICATIONS for example workflows
• LEARNING HUB under RESOURCES
www.knime.org/learning-hub
• KNIME Tech pages (tech.knime.org)
• FORUM for questions and answers
• DOCUMENTATION for documentation, FAQ, changelogs, ...
• LABS where to find new experimental nodes
• COMMUNITY CONTRIBUTIONS for development instructions
and third party nodes
• KNIME TV channel on
• KNIME on @KNIME
79

More Related Content

What's hot (20)

PDF
KNIME Software Overview
KNIMESlides
 
PPTX
KNIME Meetup 2016-04-16
W. Daniel Cox, III CMA, CFM
 
PDF
What's New in KNIME Analytics Platform 4.0 and KNIME Server 4.9
KNIMESlides
 
PDF
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
InfluxData
 
PDF
KNIME Data Science Learnathon: From Raw Data To Deployment
KNIMESlides
 
PDF
Heterogeneous Data Mining with Spark
KNIMESlides
 
PPTX
Docker data science pipeline
DataWorks Summit
 
PPTX
A Walkthrough of InfluxCloud 2.0 by Tim Hall
InfluxData
 
PDF
Martin Moucka [Red Hat] | How Red Hat Uses gNMI, Telegraf and InfluxDB to Gai...
InfluxData
 
PDF
Flink London meetup 3 March 2016 - Flink basics
Cyrus New
 
PDF
Apache NiFi: Ingesting Enterprise Data At Scale
Timothy Spann
 
PDF
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink Forward
 
PPTX
How EnerKey Using InfluxDB Saves Customers Millions by Detecting Energy Usage...
InfluxData
 
PPTX
Building A Self Service Streaming Platform at Pinterest - Steven Bairos-Novak...
Flink Forward
 
PDF
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon based
KNIMESlides
 
PDF
Building Audi’s enterprise big data platform
DataWorks Summit
 
PDF
Build and Run Streaming Applications with Apache Flink and Amazon Kinesis Dat...
Flink Forward
 
PDF
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
InfluxData
 
PDF
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
Kai Wähner
 
PDF
Building a Streaming Data Pipeline for Trains Delays Processing
Databricks
 
KNIME Software Overview
KNIMESlides
 
KNIME Meetup 2016-04-16
W. Daniel Cox, III CMA, CFM
 
What's New in KNIME Analytics Platform 4.0 and KNIME Server 4.9
KNIMESlides
 
Gain Deep Visibility into APIs and Integrations with Anypoint Monitoring
InfluxData
 
KNIME Data Science Learnathon: From Raw Data To Deployment
KNIMESlides
 
Heterogeneous Data Mining with Spark
KNIMESlides
 
Docker data science pipeline
DataWorks Summit
 
A Walkthrough of InfluxCloud 2.0 by Tim Hall
InfluxData
 
Martin Moucka [Red Hat] | How Red Hat Uses gNMI, Telegraf and InfluxDB to Gai...
InfluxData
 
Flink London meetup 3 March 2016 - Flink basics
Cyrus New
 
Apache NiFi: Ingesting Enterprise Data At Scale
Timothy Spann
 
Flink for Everyone: Self Service Data Analytics with StreamPipes - Philipp Ze...
Flink Forward
 
How EnerKey Using InfluxDB Saves Customers Millions by Detecting Energy Usage...
InfluxData
 
Building A Self Service Streaming Platform at Pinterest - Steven Bairos-Novak...
Flink Forward
 
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon based
KNIMESlides
 
Building Audi’s enterprise big data platform
DataWorks Summit
 
Build and Run Streaming Applications with Apache Flink and Amazon Kinesis Dat...
Flink Forward
 
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
InfluxData
 
R, Spark, Tensorflow, H20.ai Applied to Streaming Analytics
Kai Wähner
 
Building a Streaming Data Pipeline for Trains Delays Processing
Databricks
 

Viewers also liked (20)

DOC
CURRICULO_LeonardoLopes _20160623
Leonardo Lopes Oliveira
 
PPTX
Knime Evaluation Smaller
vijaydj
 
PDF
KNIME - Create Workflow with KNIME
Billy Wong
 
PPTX
Sentiment analysis
Jennifer D. Davis, Ph.D.
 
PDF
Big Data with KNIME is as easy as 1, 2, 3, ...4!
KNIMESlides
 
PPTX
Sentiment Analysis Using Machine Learning
Nihar Suryawanshi
 
PDF
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sangeeth Nagarajan
 
PDF
Text Processing with KNIME
KNIMESlides
 
PPTX
Machine Learning with Scala
Susan Eraly
 
PPTX
SearchLove Boston 2016 | Paul Shapiro | How to Automate Your Keyword Research
Distilled
 
PDF
Roadmap бессмертие final
Юрий Каргаполов
 
PDF
Coursera 2015 - Введение в биоинформатику
Andrey Zhelankin
 
PDF
Радченко И. Открытые биомедицинские данные
bigdatabm
 
KEY
Выстраиваем фреймворк обучения от команды до компании
Yury Shilyaev
 
PPTX
Лукина Ольга. Безопасность в соц. сетях
LiloSEA
 
PPTX
2015 голограмма коллектива
Evgeniy Pavlovskiy
 
PPT
Яков Сироткин - Автобус не придет | Happydev'12
HappyDev
 
PPTX
Левкович-Маслюк Л.И. Задачи и проекты центра исследований и разработок ЕМС Ск...
bigdatabm
 
PPTX
It meet up 1 0 лебедева, лоханов
Victoria Astapenko
 
PDF
It meet up 2015 biology bioinformatics
Victoria Astapenko
 
CURRICULO_LeonardoLopes _20160623
Leonardo Lopes Oliveira
 
Knime Evaluation Smaller
vijaydj
 
KNIME - Create Workflow with KNIME
Billy Wong
 
Sentiment analysis
Jennifer D. Davis, Ph.D.
 
Big Data with KNIME is as easy as 1, 2, 3, ...4!
KNIMESlides
 
Sentiment Analysis Using Machine Learning
Nihar Suryawanshi
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sangeeth Nagarajan
 
Text Processing with KNIME
KNIMESlides
 
Machine Learning with Scala
Susan Eraly
 
SearchLove Boston 2016 | Paul Shapiro | How to Automate Your Keyword Research
Distilled
 
Roadmap бессмертие final
Юрий Каргаполов
 
Coursera 2015 - Введение в биоинформатику
Andrey Zhelankin
 
Радченко И. Открытые биомедицинские данные
bigdatabm
 
Выстраиваем фреймворк обучения от команды до компании
Yury Shilyaev
 
Лукина Ольга. Безопасность в соц. сетях
LiloSEA
 
2015 голограмма коллектива
Evgeniy Pavlovskiy
 
Яков Сироткин - Автобус не придет | Happydev'12
HappyDev
 
Левкович-Маслюк Л.И. Задачи и проекты центра исследований и разработок ЕМС Ск...
bigdatabm
 
It meet up 1 0 лебедева, лоханов
Victoria Astapenko
 
It meet up 2015 biology bioinformatics
Victoria Astapenko
 
Ad

Similar to Knime &amp; bioinformatics (20)

PDF
Big Data with KNIME.pdf
James Vp
 
PDF
Open Source Story and what’s new in KNIME Software
KNIMESlides
 
PDF
KNIME For Data Analytics Course Overview
BakhtiarAmaludin
 
PPTX
Building an AI and ML Model Using KNIME and Python.pptx
ssuser448ad3
 
PPTX
Knime (Konstanz Information Miner)
Kiran Buriro
 
PDF
What's New in KNIME Analytics Platform 4.1
KNIMESlides
 
PPTX
KNIME_Overview_Presentation data mining tools
YazanMohamed1
 
PPTX
H2O Machine Learning with KNIME Analytics Platform - Christian Dietz - H2O AI...
Sri Ambati
 
PDF
Interactive and reproducible data analysis with the open-source KNIME Analyti...
Greg Landrum
 
PDF
Sharing and Deploying Data Science with KNIME Server
KNIMESlides
 
PDF
KNIME_Server_ProductSheet_122020.pdf
LeangsengLim1
 
PDF
Big Data Modeling Challenges and Machine Learning with No Code
Liana Ye
 
PDF
Let’s talk about reproducible data analysis
Greg Landrum
 
PPTX
KNIME_Introduction_panduan mengggunakan knimepptx
muhajirin10
 
PDF
Code camp 2015 visual programming mm
Mitch Miller
 
PDF
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
VeniAgustina1
 
PPTX
Big Data in 200 km/h | AWS Big Data Demystified #1.3
Omid Vahdaty
 
PDF
Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Greg Landrum
 
PDF
Chemistry Data Basics with KNIME Analytics Platform
KNIMESlides
 
PPTX
KNIME Data Connect - 5th December 2024 (Arief).pptx
DwiCahya58
 
Big Data with KNIME.pdf
James Vp
 
Open Source Story and what’s new in KNIME Software
KNIMESlides
 
KNIME For Data Analytics Course Overview
BakhtiarAmaludin
 
Building an AI and ML Model Using KNIME and Python.pptx
ssuser448ad3
 
Knime (Konstanz Information Miner)
Kiran Buriro
 
What's New in KNIME Analytics Platform 4.1
KNIMESlides
 
KNIME_Overview_Presentation data mining tools
YazanMohamed1
 
H2O Machine Learning with KNIME Analytics Platform - Christian Dietz - H2O AI...
Sri Ambati
 
Interactive and reproducible data analysis with the open-source KNIME Analyti...
Greg Landrum
 
Sharing and Deploying Data Science with KNIME Server
KNIMESlides
 
KNIME_Server_ProductSheet_122020.pdf
LeangsengLim1
 
Big Data Modeling Challenges and Machine Learning with No Code
Liana Ye
 
Let’s talk about reproducible data analysis
Greg Landrum
 
KNIME_Introduction_panduan mengggunakan knimepptx
muhajirin10
 
Code camp 2015 visual programming mm
Mitch Miller
 
From_SPSS Modeler_to_KNIME_v4.7_ebook.pdf
VeniAgustina1
 
Big Data in 200 km/h | AWS Big Data Demystified #1.3
Omid Vahdaty
 
Google BigQuery for analysis of scientific datasets: Interactive exploration ...
Greg Landrum
 
Chemistry Data Basics with KNIME Analytics Platform
KNIMESlides
 
KNIME Data Connect - 5th December 2024 (Arief).pptx
DwiCahya58
 
Ad

More from BioinformaticsInstitute (20)

PPTX
Graph genome
BioinformaticsInstitute
 
PDF
Nanopores sequencing
BioinformaticsInstitute
 
PDF
A superglue for string comparison
BioinformaticsInstitute
 
PDF
Comparative Genomics and de Bruijn graphs
BioinformaticsInstitute
 
PDF
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
BioinformaticsInstitute
 
PPTX
Вперед в прошлое. Методы генетической диагностики древней днк
BioinformaticsInstitute
 
PDF
"Зачем биологам суперкомпьютеры", Александр Предеус
BioinformaticsInstitute
 
PDF
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
BioinformaticsInstitute
 
PDF
Рак 101 (Мария Шутова, ИоГЕН РАН)
BioinformaticsInstitute
 
PDF
Плюрипотентность 101
BioinformaticsInstitute
 
PDF
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
BioinformaticsInstitute
 
PPTX
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
BioinformaticsInstitute
 
PPT
Biodb 2011-everything
BioinformaticsInstitute
 
PPT
Biodb 2011-05
BioinformaticsInstitute
 
PPT
Biodb 2011-04
BioinformaticsInstitute
 
PPT
Biodb 2011-03
BioinformaticsInstitute
 
PPT
Biodb 2011-01
BioinformaticsInstitute
 
PPT
Biodb 2011-02
BioinformaticsInstitute
 
Nanopores sequencing
BioinformaticsInstitute
 
A superglue for string comparison
BioinformaticsInstitute
 
Comparative Genomics and de Bruijn graphs
BioinformaticsInstitute
 
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
BioinformaticsInstitute
 
Вперед в прошлое. Методы генетической диагностики древней днк
BioinformaticsInstitute
 
"Зачем биологам суперкомпьютеры", Александр Предеус
BioinformaticsInstitute
 
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
BioinformaticsInstitute
 
Рак 101 (Мария Шутова, ИоГЕН РАН)
BioinformaticsInstitute
 
Плюрипотентность 101
BioinformaticsInstitute
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
BioinformaticsInstitute
 
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
BioinformaticsInstitute
 
Biodb 2011-everything
BioinformaticsInstitute
 

Recently uploaded (20)

PDF
Primordial Black Holes and the First Stars
Sérgio Sacani
 
PPTX
LESSON 2 PSYCHOSOCIAL DEVELOPMENT.pptx L
JeanCarolColico1
 
PPTX
PEDIA IDS IN A GIST_6488b6b5-3152-4a4a-a943-20a56efddd43 (2).pptx
tdas83504
 
PPTX
Structure and uses of DDT, Saccharin..pptx
harsimrankaur204
 
PDF
crestacean parasitim non chordates notes
S.B.P.G. COLLEGE BARAGAON VARANASI
 
PPT
Cell cycle,cell cycle checkpoint and control
DrMukeshRameshPimpli
 
PDF
Insect Behaviour : Patterns And Determinants
SheikhArshaqAreeb
 
PDF
2025-06-10 TWDB Agency Updates & Legislative Outcomes
tagdpa
 
PDF
Carbon-richDustInjectedintotheInterstellarMediumbyGalacticWCBinaries Survives...
Sérgio Sacani
 
PPTX
Different formulation of fungicides.pptx
MrRABIRANJAN
 
PPTX
Immunopharmaceuticals and microbial Application
xxkaira1
 
PDF
The-Origin- of -Metazoa-vertebrates .ppt
S.B.P.G. COLLEGE BARAGAON VARANASI
 
PDF
Adding Geochemistry To Understand Recharge Areas - Kinney County, Texas - Jim...
Texas Alliance of Groundwater Districts
 
PPTX
How to write a research paper July 3 2025.pptx
suneeta panicker
 
PDF
Step-by-Step Guide: How mRNA Vaccines Works
TECNIC
 
PDF
Annual report 2024 - Inria - English version.pdf
Inria
 
PPTX
MICROBIOLOGY PART-1 INTRODUCTION .pptx
Mohit Kumar
 
PPTX
Hypothalamus_nuclei_ structure_functions.pptx
muralinath2
 
PPTX
Animal Reproductive Behaviors Quiz Presentation in Maroon Brown Flat Graphic ...
LynetteGaniron1
 
PPTX
Qualification of DISSOLUTION TEST APPARATUS.pptx
shrutipandit17
 
Primordial Black Holes and the First Stars
Sérgio Sacani
 
LESSON 2 PSYCHOSOCIAL DEVELOPMENT.pptx L
JeanCarolColico1
 
PEDIA IDS IN A GIST_6488b6b5-3152-4a4a-a943-20a56efddd43 (2).pptx
tdas83504
 
Structure and uses of DDT, Saccharin..pptx
harsimrankaur204
 
crestacean parasitim non chordates notes
S.B.P.G. COLLEGE BARAGAON VARANASI
 
Cell cycle,cell cycle checkpoint and control
DrMukeshRameshPimpli
 
Insect Behaviour : Patterns And Determinants
SheikhArshaqAreeb
 
2025-06-10 TWDB Agency Updates & Legislative Outcomes
tagdpa
 
Carbon-richDustInjectedintotheInterstellarMediumbyGalacticWCBinaries Survives...
Sérgio Sacani
 
Different formulation of fungicides.pptx
MrRABIRANJAN
 
Immunopharmaceuticals and microbial Application
xxkaira1
 
The-Origin- of -Metazoa-vertebrates .ppt
S.B.P.G. COLLEGE BARAGAON VARANASI
 
Adding Geochemistry To Understand Recharge Areas - Kinney County, Texas - Jim...
Texas Alliance of Groundwater Districts
 
How to write a research paper July 3 2025.pptx
suneeta panicker
 
Step-by-Step Guide: How mRNA Vaccines Works
TECNIC
 
Annual report 2024 - Inria - English version.pdf
Inria
 
MICROBIOLOGY PART-1 INTRODUCTION .pptx
Mohit Kumar
 
Hypothalamus_nuclei_ structure_functions.pptx
muralinath2
 
Animal Reproductive Behaviors Quiz Presentation in Maroon Brown Flat Graphic ...
LynetteGaniron1
 
Qualification of DISSOLUTION TEST APPARATUS.pptx
shrutipandit17
 

Knime &amp; bioinformatics

  • 1. Copyright © 2015 KNIME.com AG Биоинформатик в тридевятом царстве, или двое программистов из ларца KNIME Oleg Yasnev KNIME.com
  • 2. Copyright © 2015 KNIME.com AG 2 А вы что же за меня и код писать будете? Ага! Кадр из мультфильма «Вовка в тридевятом царстве» © «Союзмультфильм»
  • 3. Copyright © 2015 KNIME.com AG 3 KNIME.com 3
  • 4. Copyright © 2015 KNIME.com AG 4 KNIME.com • KNIME.com founded in 2008 • Offices in Zurich, San Francisco (Aug ‘13), Berlin (May ‘14) and Konstanz (October ‘15) • 15 open source releases, 10 product releases (in 2014) • >2m lines of code • 600k lines of community code 4
  • 5. Copyright © 2015 KNIME.com AG 5 Advanced Analytics Pharma Health Care Finance Retail Customer Intelligence Manu- facturing Broad Range of KNIME Application Areas 5
  • 6. Copyright © 2015 KNIME.com AG 6 The KNIME Analytics Platform 6
  • 7. Copyright © 2015 KNIME.com AG 7 From Access to Visualization and Deployment
  • 8. Copyright © 2015 KNIME.com AG 8 Data Access • Databases – MySQL, PostgreSQL – any JDBC (Oracle, DB2, MS SQL Server) • Files – Csv, txt – Excel, Word, PDF – SAS, SPSS – XML – PMML – Images, texts, networks, chem • Web, Cloud – REST, Web services – Twitter, Google
  • 9. Copyright © 2015 KNIME.com AG 9 Big Data • HDFS support • Hive • Impala • HP Vertica • In-database processing
  • 10. Copyright © 2015 KNIME.com AG 10 Transformation • Preprocessing – Row, column, matrix based • Data blending – Join, concatenate, append • Aggregation – Grouping, pivoting, binning • Feature Creation and Selection
  • 11. Copyright © 2015 KNIME.com AG 11 Analyze & Data Mining • Regression – Linear, Logistic • Classification – Decision tree, ensembles, SVM, MLP, Naïve Bayes • Clustering – k-means, DBSCAN, hierarchical • Validation – Cross-validation, scoring, ROC • Misc – PCA, MDS, item set mining • External – R, Weka
  • 12. Copyright © 2015 KNIME.com AG 12 Visualization • Interactive – Scatter plot, histogram, pie charts, box plot – Highlighting (brushing) • JFreeChart • JavaScript • Misc – Tag cloud, open street map, networks, molecules • External – R
  • 13. Copyright © 2015 KNIME.com AG 13 Deployment • Database • Files – Excel, csv, txt – XML – PMML – to: local, KNIME Server, SSH-, FTP-Server • BIRT Reporting
  • 14. Copyright © 2015 KNIME.com AG 14 Statistics Data Mining Machine Learning Web Analytics Text Mining Network Analysis Social Media Analysis WEKA R Community / 3rd MySQL, Oracle, etc. SAS, SPSS, etc. Excel, Flat, etc. Hive etc. XML, PMML Text, Doc, Image Web Crawlers Industry Specific Community / 3rd ETL Row, Column Matrix Text, Image Time Series Java Python Community / 3rd R JFreeChart Community / 3rd via BIRT PMML XML Databases Excel, Flat, etc. Hive etc. Text, Doc, Image Industry Specific Community / 3rd Over 1000 native and embedded nodes included: 14
  • 15. Copyright © 2015 KNIME.com AG 15 KNIME: Integrating Data and Tools 15
  • 16. Copyright © 2015 KNIME.com AG 16 Big Data. Pre-processing on Hadoop
  • 17. Copyright © 2015 KNIME.com AG 17 In-Database Processing 17 Loads your pre-processed data into KNIME
  • 18. Copyright © 2015 KNIME.com AG 18 Reader/Writer • Table selection • Load data into KNIME • Create table as select • Insert/append data • Delete rows from table • Update values in table 18
  • 19. Copyright © 2015 KNIME.com AG 19 Hive/Impala Loader • Upload a KNIME data table to Hive/Impala • Part of the commercial Big Data Extension 19
  • 20. Copyright © 2015 KNIME.com AG 20 Manipulation • Filter rows and columns • Join tables/queries • Sort your data • Write your own query • Aggregate your data 20
  • 21. Copyright © 2015 KNIME.com AG 21 Database GroupBy – Manual Aggregation 21
  • 22. Copyright © 2015 KNIME.com AG 22 Database GroupBy – Type Based Aggregation 22 Matches all cells Matches all numeric cells
  • 23. Copyright © 2015 KNIME.com AG 23 Utility • Drop table – missing table handling – cascade option • Execute any SQL statement e.g. DDL • Manipulate existing queries 23
  • 24. Copyright © 2015 KNIME.com AG 24 HDFS File Handling • New nodes – HDFS Connection – HDFS File Permission • Utilize the existing remote file handling nodes – Upload/download files – Create/list directories – Delete files 24
  • 25. Copyright © 2015 KNIME.com AG 25 HDFS File Handling 25
  • 26. Copyright © 2015 KNIME.com AG 26 Workflow 1: PrepareData 26 ~ 2 days Irish Smart Energy Meter Trials • July 2009 – Dec 2010 • 6000 meters • roughly 176m rows of data
  • 27. Copyright © 2015 KNIME.com AG 27 Import Data from Database into KNIME 27 < 30 min
  • 28. Copyright © 2015 KNIME.com AG 28 Big Data. Machine Learning on Hadoop
  • 29. Copyright © 2015 KNIME.com AG 29 Machine Learning on Hadoop • Based on Spark MLlib • Scalable machine learning library • Runs on Hadoop • Algorithms for – Classification (decision tree, naïve bayes, …) – Regression (logistic regression, linear regression, …) – Clustering (k-means) – Collaborative filtering (ALS) – Dimensionality reduction (SVD, PCA) 29
  • 30. Copyright © 2015 KNIME.com AG 30 MLlib Integration • Usage model and dialogs similar to existing nodes • No coding required
  • 31. Copyright © 2015 KNIME.com AG 31 MLlib Integration • MLlib model ports for model transfer • Native MLlib model learning and prediction • Spark nodes start and manage Spark jobs • Supports Spark job cancelation Native MLlib model
  • 32. Copyright © 2015 KNIME.com AG 32 MLlib Integration • Spark RDDs as input/output format • Data stays within your cluster • No unnecessary data movements • Several input/output nodes e.g. Hive, hdfs files, …
  • 33. Copyright © 2015 KNIME.com AG 33 Mass Learning – Fast Event Prediction • Convert supported MLlib models to PMML • Mass learning on Hadoop • Fast event prediction based on compiled models
  • 34. Copyright © 2015 KNIME.com AG 34 Mix and Match • Combine with existing KNIME nodes
  • 35. Copyright © 2015 KNIME.com AG 36 Modularize and Execute Your Own Spark Code
  • 36. Copyright © 2015 KNIME.com AG 37 Spark Node Overview
  • 37. Copyright © 2015 KNIME.com AG 38 А что же Rocket Science? 38
  • 38. Copyright © 2015 KNIME.com AG 39 Community Contributors 39 Technology Partners Distribution & Consulting Partners Community Contributors Community User Base Donated by Companies Contributions from Research Institutions Maintained by KNIME
  • 39. Copyright © 2015 KNIME.com AG 40 Community Contributors 40 Technology Partners Distribution & Consulting Partners Community Contributors Community User Base Academic Institutions: • Universität Tübingen (BALL, OpenMS) • Freie Universität Berlin (SeqAn) • MPI Dresden (ImgLib) • Universität Dresden (Palladin) • ETH Zürich (OpenBIS) • Dublin University (OMERO) • University of Wisconsin (ImageJ2) • … Commercial Contributors: • Dymatrix Consulting Group (Uplift Nodes) • Eli Lilly (ChemInf suite) • Novartis (RDKit, Indigo) • Vernalis (Proteomics) • Cenix (REST Nodes) • Böhringer-Ingelheim (various sponsored nodes) • …
  • 40. Copyright © 2015 KNIME.com AG 41 Bioinformatics https://tech.knime.org/bioinformatics-and-next- generation-sequencing-extensions
  • 41. Copyright © 2015 KNIME.com AG 42 OpenMS Open-source software C++ library for liquid chromatography–mass spectrometry data management and analyses. https://tech.knime.org/community/bio inf/openms
  • 42. Copyright © 2015 KNIME.com AG 43 SeqAn Open-source C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. https://tech.knime.org/seqan-nodes- for-knime
  • 43. Copyright © 2015 KNIME.com AG 44 NGS Nodes and workflows used for processing next generation sequencing results https://tech.knime.org/community/n ext-generationsequencing
  • 44. Copyright © 2015 KNIME.com AG 45 knime4bio Set of custom nodes for analysing NGS data https://code.google.com/p/knime4bio/
  • 45. Copyright © 2015 KNIME.com AG 46 Image Processing https://tech.knime.org/community/image-processing
  • 46. Copyright © 2015 KNIME.com AG 47 Active Classification in Cell Assay Images • Different modules for segmentation and feature extraction • Active Learning
  • 47. Copyright © 2015 KNIME.com AG 48 Active Classification in Cell Assay Images CellMiner Nodes Plate/Image Reading – Plate Reader, Plate Editor, Plate View Preprocessing – Noise Filtering, Lowpass Filter Segmentation – Threshold based Segmentation, Voronoi Segmentation Features – Line, Histogram, Texture, RGB, Zernike Moments, Shape Active Classification
  • 48. Copyright © 2015 KNIME.com AG 49 Chemistry and Cheminformatics https://tech.knime.org/cheminformatics-extensions
  • 49. Copyright © 2015 KNIME.com AG 50 Selected Open Source extensions 50
  • 50. Copyright © 2015 KNIME.com AG 51 Selected commercial extensions 51
  • 51. Copyright © 2015 KNIME.com AG 52 Overview of types in KNIME 52 • Basic KNIME types • string, integer, double • KNIME core chemistry types: • smiles, sdf, mol, mol2 • Structures in these formats can be rendered in KNIME tables
  • 52. Copyright © 2015 KNIME.com AG 53 Nodes for type manipulation 53 • Molecule Type Cast • Casts any string as a chemical type (i.e. It tells KNIME “This is a smiles string”) • Useful when reading data form a csv file or database. • Marvin MolConverter • Provided by Chemaxon/Infocom • Translates seamlessly between types (smiles  sdf  mrv)
  • 53. Copyright © 2015 KNIME.com AG 54 Nodes for reading and writing files 54 Reader and writers provided for: - sdf, smiles, mol, mol2
  • 54. Copyright © 2015 KNIME.com AG 55 Sketching chemical structures – use Marvin 55 MarvinSketch • Provided by Chemaxon/Infocom • Sketch structures in the configuration dialog • Execute node to inject structures into workflow
  • 55. Copyright © 2015 KNIME.com AG 56 RDKit 56 • Open source cheminfo library in c++ • Wrappers for KNIME maintained by the open source community • Useful for: Descriptor calculation Cleaning structures InChi conversion Standardizing smiles Fingerprints Scaffolds/substructures Reaction simulation and more…
  • 56. Copyright © 2015 KNIME.com AG 57 Infocom JChem KNIME Nodes Extensions of ChemAxon’s tool for KNIME workflow Infocom implements it with the support of ChemAxon Contains over 90% of ChemAxon's cheminformatics functionality
  • 57. Copyright © 2015 KNIME.com AG 58 ChEMBL 58 A public database of bioactive druglike compounds ~1.3 mio compounds ~ 9k targets ~12 mio bioactivitities Provided by the European Bioinformatics Institute Accessible online at www.ebi.ac.uk/chembl or via EBI provided KNIME nodes…
  • 58. Copyright © 2015 KNIME.com AG 59 New Node: ChEMBLdb Connector 59 Access data in ChEMBL via a web service call (internet access required) Lookup by ChEMBLID or InChi Key Retrieve structure and bioactivity data Compound search using smiles exact, similarity, or substructure
  • 59. Copyright © 2015 KNIME.com AG 60 Tool Integrations
  • 60. Copyright © 2015 KNIME.com AG 61 • Select the KNIME version for your computer – (Mac, Win, Linux) • Copy to your local machine • Unpack the file in a “nice” place Install KNIME 61
  • 61. Copyright © 2015 KNIME.com AG 62 Start KNIME Go to the installation directory and launch KNIME. 62
  • 62. Copyright © 2015 KNIME.com AG 63 The Workspace • The workspace is the folder in which workflows (and potentially data files) for the current KNIME session is stored. • Workspaces are portable (just like KNIME) 63
  • 63. Copyright © 2015 KNIME.com AG 64 Starting KNIME for the first time 64 Install additional extensions Goes straight to the KNIME workbench
  • 64. Copyright © 2015 KNIME.com AG 65 The KNIME Workbench 65
  • 65. Copyright © 2015 KNIME.com AG 66 A basic workflow 66
  • 66. Copyright © 2015 KNIME.com AG 67 More on nodes… A node can have 3 states: 67 Idle: The node is not yet configured and can not be executed with it’s current settings. Configured: The node has been set up correctly, and may be executed at any time Executed: The node has been successfully executed. Results may be viewed and used in downstream nodes.
  • 67. Copyright © 2015 KNIME.com AG 68 Node configuration • Most nodes require configuration • To access a node configuration window: • Double-click the node • Right-click > Configure 68
  • 68. Copyright © 2015 KNIME.com AG 69 Node execution • Right-click node • Select Execute in context menu • If execution is successful, status shows green light • If execution encounters errors, status shows red light 69
  • 69. Copyright © 2015 KNIME.com AG 70 Node views • Right-click node • Select Views in context menu • Select output port to inspect execution results 70
  • 70. Copyright © 2015 KNIME.com AG 71 Hotkeys (for future reference) 71
  • 71. Copyright © 2015 KNIME.com AG 72 A Peak under the Hood: KNIME (Node) Development 72
  • 72. Copyright © 2015 KNIME.com AG 73 Node Architecture • KNIME interacts only with a Node • Node takes care of embedding the node in the infrastructure • New nodes implement Model/View/Dialog 73 class Node (final) class Node- Dialog- Pane (abstract) class Node- View (abstract) class Node- Model (abstract) class NodeFactory (abstract)
  • 73. Copyright © 2015 KNIME.com AG 74 Node Extension Wizard • Included in the KNIME Developer Version • Allows creation of plugin projects including functioning KNIME nodes (with sample code) • Helpful to easily create all node classes – Generates all Java classes – Node is registered with the plugin project – Launch KNIME and enjoy the new node working! 74
  • 74. Copyright © 2015 KNIME.com AG 75 Node Extension Wizard 75
  • 75. Copyright © 2015 KNIME.com AG 76 Node Extension Wizard • Specify all settings to create a new KNIME node – In a completely new plugin project, or – Into an existing project • Node type: Sink, Source, Learner, Predictor, Manipulator, Visualizer, Meta, or Other • Include sample code or not 76
  • 76. Copyright © 2015 KNIME.com AG 77 Node Extension Wizard • Contains all Java classes (including sample code) • Node is registered in the plugin.xml • NodeDialog and NodeView class are also created and registered to the NodeFactory 77
  • 77. Copyright © 2015 KNIME.com AG 78 Node Development 78
  • 78. Copyright © 2015 KNIME.com AG 79 Resources • KNIME pages (www.knime.org) • APPLICATIONS for example workflows • LEARNING HUB under RESOURCES www.knime.org/learning-hub • KNIME Tech pages (tech.knime.org) • FORUM for questions and answers • DOCUMENTATION for documentation, FAQ, changelogs, ... • LABS where to find new experimental nodes • COMMUNITY CONTRIBUTIONS for development instructions and third party nodes • KNIME TV channel on • KNIME on @KNIME 79