40 Zettabytes is a huge figure, which in 2020 will grow to around 6 to 7 terabytes (6 000 000 000 000 bytes) for each and every inhabitant on earth. And this number doubles every 9 months, which means that over the next 10 years it will be multiplied by over 8 000 times.
Even more important is the fact that 75% of this information is destructured and therefore, as we have seen, is very difficult to use. This is why all the big brands, such as Amazon, Google, Facebook, IBM, Microsoft and Apple, to name but a few, are working on big-data, data mining and artificial intelligence, as well as many START-UPS across the world.
All these major players are in this field because, as we said, the possibility of using, handling and enriching these data represents a crucial challenge for business. Any unused data is sleeping money. Restructuring breathes new life into these data and makes them as valuable as they were when they were created.
There is a huge amount of sleeping data inside our computers and storage units, including:
- The data in certain databases, which, when combined with information from other databases, represents a great wealth. In this case, the data is semi-structured
- The data present on the Internet are also semi-structured
- Photos, images, videos, audio tracks are destructured data
- PDF files, which we’ve already mentioned, and which are present throughout the office software universe
- Sensors and the IoT which record huge quantities of measurements
Our aim is to restructure graphic documents which combine text and drawings produced by software and stored in various vector formats:
- Building plans
- Hydraulic, pneumatic, electric diagrams
- Automation diagrams
- Industrial process charts
- Highway plans
- 2D mechanical diagrams
- Assembly and dismantling instructions
- Machining sequences
- Methodological sheets