Structure Your Data Science Projects by Continuously Telling Stories

Yet, just not arranging an exploration project additionally conveys a great deal of hazard. I've seen numerous a PhD competitor do a very long time of exploration with nothing truly to show for it, causing a ton of nervousness for the up-and-comer. So presently we are left with a problem: how to design the difficult to design? In this article I need to introduce a few thoughts that are focused on constantly advising yourself and your colleagues the story or account of your undertaking: for what reason are we doing this and what progress have we made. These thoughts were brought into the world from my own insight, and from attempting to assist my understudies with doing their activities. This is in no way, shape or form the most important thing in the world of how to run an information science project, yet I trust the thoughts will demonstrate significant to you.


Reflection is king
Probably the greatest danger I find in an information science project is that you keep doing explore without considering your advancement. You accomplish loads of work, yet not really tackling the issue. What makes this sort of reflection hard is simply the way that you need to lift yourself up from the bare essential of your work, and take a helicopter see. How do my outcomes progress me towards my definitive objective? 

What truly helps driving a helicopter see is utilizing a strategy like CRISP-DM. This model cuts up the information science work process into various stages, going from evaluating your business issue and your information, to performing examinations and considering what they mean for the business issue. The objective isn't to simply perform factual investigations, yet to make a story that completely responds to the business question. The coordinated progression of CRISP-DM reminds us to reflect frequently.
Embrace iterations
Via Agile methods such as Scrum, the IT world has embraced iterative working. In research this is just as, if not more important. Our project does not consist of one big reflection (CRISP-DM) cycle, but dozens of cycles that craft dozens of narratives. These smaller narratives form the backbone of the larger overall story.
Continuously refine and merge
Using reflection, narratives and iteration we can structure our data science project. In my experience, at the beginning of the project we start with a very basic understanding of the business problem. For example: how can we detect anomalies in x-ray images. From this basic premise we craft our first few starting narratives. One simple narrative could be that we ask the hospital for two sets of x-ray images, one that does and one that does not contain anomalies. We simply look with our eyes to see if we can tell the two sets of images apart. From that visual inspection we craft a narrative. In the evaluation part of the narrative, ideas for statistical tools to use can emerge as new prospective narratives.
As the project moves along, the narratives become more and more intricate and build on top of each other. In addition, some narratives are deemed dead ends and are discarded. If we write a research paper at the end, this presents the final overall narrative that wraps and summarises all the previous smaller narratives. The later intricate narratives often influence the final paper more strongly.
Planning narratives
But how do we plan the flow of all these interlocking narratives? We can try and borrow some concepts from Scrum. We can use a planning board to track our narratives, with the prospective narratives listed in the backlog. In the sprint review we can present and spar about the narratives, further honing their storylines. In addition, we can use the sprint planning to decide which of the narratives on the backlog presents the best way forward. Finally, in the Sprint Retrospective we discuss as a team how we can collaborate more effectively. Other staples such as fixed sprint length or using story points for narratives might not work very well, but we can still create a structured way of working in the hard to plan world of data science research.

Post a Comment

0 Comments