We said goodbye to the year, which was very good for us in terms of the company’s growth. On this occasion, we have prepared a summary of what we have achieved, and what makes us believe that 2020 will be even better. The year 2019 in numbers Headcount growth from 18 to 43 which yields
Optimization of the production process is always one of the priorities of every manufacturer. The world has already witnessed 3 industrial revolutions, each of them has redefined how goods are being made, and each of them has established the new market leaders. Now, the 4th revolution, called Industry 4.0, is happening in front of us.
In this article, I will present the concept of data vectorization using a NumPy library. We will benchmark several approaches to compute Euclidean Distance efficiently. NumPy is a Python library for manipulating multidimensional arrays in a very efficient way. It is also a base for scientific libraries (like pandas or SciPy) that are commonly used
Summarization is useful whenever you need to condense a big number of documents into smaller texts. Anyone who browsed scientific papers knows the value of abstracts – unfortunately, in general documents don’t share this structure. This article is an overview of some text summarization methods in Python.
Recently, in Semantive we have started to use serverless principles in our projects quite extensively and re:Invent proved to be a perfect place to learn something new and see how others deal with problems related to this technology. Serverless, along with Deep Learning was one of the most prominent topics highlighted during sessions and keynotes.
Earlier this year, like most undergraduate students before holidays, I was wondering what I should focus on in regards to my career development. I decided to try data science for a few reasons – it is not only one of the hottest, best-paid professions right now, but also allows the development of two areas I
AWS re:Invent is the biggest cloud computing community event in the world. Last week, Las Vegas was a place where AWS brought together over 50k professionals eager to learn, network and share their experiences. 4 of us from Semantive had an immense pleasure to be a part of the event and want to share some
Running another asynchronous process from within the Step Functions is a non-trivial task. Traditional approaches have many shortcomings which may make them unfeasible in production environment. The Activity Task can be used to solve this issue and avoid most of common problems. It is achieved by using pattern similar to Correlation Identifier. In first part
Today, we share our experience from a Kaggle competition where we needed to detect ships on satellite images. We faced numerous challenges along the way and worked out effective strategies on how to solve them. Make sure to understand the problem When starting out a new project, it is always important not to jump into