You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: 03_workflow/034_workflow_management_concepts.html
+8-1Lines changed: 8 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -232,6 +232,7 @@ <h3 id="models">Models<a class="headerlink" href="#models" title="Permanent link
232
232
<p>In the realm of data science, several established workflow management models help guide teams through the complexities of data projects. These models are designed to ensure that every phase of a project aligns with business objectives and leverages data insights effectively.</p>
233
233
<h4id="crisp-dm_cross-industry_standard_process_for_data_mining">CRISP-DM (Cross-Industry Standard Process for Data Mining)<aclass="headerlink" href="#crisp-dm_cross-industry_standard_process_for_data_mining" title="Permanent link">#</a></h4>
234
234
<p>CRISP-DM is a widely adopted model that provides a comprehensive framework for carrying out data mining projects. It consists of six phases: business understanding, data understanding, data preparation, modeling, evaluation, and deployment. This model emphasizes a cyclical process allowing for continuous improvements based on insights gained from previous iterations.</p>
<h4id="tdsp_team_data_science_process">TDSP (Team Data Science Process)<aclass="headerlink" href="#tdsp_team_data_science_process" title="Permanent link">#</a></h4>
265
267
<p>Developed by Microsoft, TDSP structures projects into five key phases: business understanding, data acquisition and understanding, modeling, deployment, and customer acceptance. It stresses the importance of iterative learning and effective communication within data science teams.</p>
268
+
<divalign="center">
266
269
<divclass="mermaid">graph TD
267
270
268
271
subgraph TDSP
@@ -297,9 +300,11 @@ <h4 id="tdsp_team_data_science_process">TDSP (Team Data Science Process)<a class
297
300
style E fill:#aaa,stroke:#333,stroke-width:2px
298
301
style F fill:#aaa,stroke:#333,stroke-width:2px
299
302
303
+
</div>
300
304
</div>
301
305
<h4id="kdd_knowledge_discovery_in_databases">KDD (Knowledge Discovery in Databases)<aclass="headerlink" href="#kdd_knowledge_discovery_in_databases" title="Permanent link">#</a></h4>
302
306
<p>KDD is a non-linear, iterative process focusing on the discovery of actionable knowledge from large volumes of data. This process involves selection, preprocessing, transformation, data mining, and the interpretation of the discovered patterns.</p>
@@ -334,9 +339,11 @@ <h4 id="kdd_knowledge_discovery_in_databases">KDD (Knowledge Discovery in Databa
334
339
335
340
end
336
341
342
+
</div>
337
343
</div>
338
344
<h4id="guos_data_science_workflow">Guo's Data Science Workflow<aclass="headerlink" href="#guos_data_science_workflow" title="Permanent link">#</a></h4>
339
345
<p>Guo's model is particularly useful for ensuring that data science projects are reproducible and transparent. It suggests a workflow where programming and exploratory data analysis are carried out in tandem, allowing for a deeper understanding and more robust analysis.</p>
346
+
<divalign="center">
340
347
<divclass="mermaid">graph LR
341
348
342
349
subgraph GUO
@@ -408,7 +415,6 @@ <h4 id="guos_data_science_workflow">Guo's Data Science Workflow<a class="headerl
408
415
409
416
end
410
417
411
-
412
418
style A fill:#aaa,stroke:#333,stroke-width:2px
413
419
style B fill:#aaa,stroke:#333,stroke-width:2px
414
420
style C fill:#aaa,stroke:#333,stroke-width:2px
@@ -425,6 +431,7 @@ <h4 id="guos_data_science_workflow">Guo's Data Science Workflow<a class="headerl
0 commit comments