Data management part 1
Last week I joined the big data London event (online, of course) I have to say that it was very inspiring! Lots of different speakers from all sort of companies and organizations. Very practical information and a lot of real life examples what’s being done with data science, machine learning and data analytics. It really inspired me and made me feel so energetic. I truly want to continue on this road that I’m on! It was a strong confirmation that I’m going in the right direction and doing exactly what I like and where I want to grow in.
I was positively surprised by the amount of choice of presentations. I joined multiple presentations and a very interesting discussion about data in general. Clouds, data lakes and the amount of data that is available now-a-days. The (best) way to threat this data and how to deal with how people using this data. Second, the amounts of data and how to manage, organize and/or structure that. Even heads of data analytics and/or heads of data scientists where speaking about those issues and having a discussion how to deal with that and what they think the future will bring us. Or course they’re thinking about multi million dollar solutions for the next 10 years, but for smaller business you basically run into some of the same problems, on a different scale.
Organise your data
Smaller businesses may have their data (excel sheets, csv’s, invoices) stored on an external hard drive, or a laptop, or a cloud, or something else. In short, your data is stored somewhere. When you’ve started small, it’s just some files saved and you have some systems collecting data, but at some point when you grow, you focus on your work and the money coming in doing that. Actively think about what data you collect, where you store it and what you can do with it, is probably not high on your priority list. Before you know it, you build your own data maze. A lot of this data is actually very useful! But because there is so much, you don’t really know what, or, it’s all stored in 5 different files, you missing the general overview or don’t really know how to connect this to let it make sense. The good news is that you’re not the only one. Even a multi million dollar company experiences the same. They spend big money on clouds, and storing data in data lakes but it all ends up in the same mess. But that’s not really a solution for you. Some good news is that you can get useful insights from it in a more affordable way!
What can you do?
Data from different systems can be uploaded to a database, from where this data can be accessed and analyzed (for example via SQL and/or python) to gain useful insights. At this stage it’s important to find out, what data you have, and what potentially can be done with it. Second but also important stage can be what data you would like to have, but what you’re missing out on at the moment. Doing it this way, you’re not only setting your data up just for an analysis once, but it provide long term insights, because data is collected every day, week or month and can be forwarded to your data base so you keep adding data to it, which can help you gaining insights longterm.
I’m working on a part two, to go a bit more in-depth. If you have any questions, wanna talk about it or want me to help you structure and organize your data, feel free to contact me and we’ll find out what’s possible for you.