Etiquetas » Bigdata

Optimized and automatized biomarker discovery and processing of biomedical data with InSyBio Pipelines

The new version of InSyBio Suite includes InSyBio Pipelines, a new tool for automatizing and optimizing the computational process for identifying biomarkers.

  • The pioneer InSyBio Pipelines tool allows performing the analysis from raw data until the final biomarkers and predictive models are found in one step simplifying the process and allowing it to be executed by non-machine learning or bioinformatics experts.
  • 422 palabras más

Build a Streaming Data Pipeline on GCP using Mac OS

Hello World! Today I wanted to share my experience on how I set up a Streaming Data pipeline on GCP using my Mac. If you use a Mac and are starting off with GCP, this blog will walk you through the resources you need to create a… 515 palabras más


The Modern Rules Of Data Festival

After the storm

The Moat is all messy from the rainstorm, the flood washed in a lot of stuff, suffice to say it’s not clean. It is going to take a lot to clean this up in preparation for the final day of the Īgue festival. 326 palabras más


Migrating billion+ documents in Elastic Search to AWS S3 !!!

Enterprises today need their data monetized, if not, leverage on the insights from the enormous amount of data to generate newer incomes from newer products. Building infrastructure, platforms to gather, clean, prepare, process, standardize and making the data (big data) available for heterogeneous stake holders amidst the ecosystem that is vastly huge, constantly changing, is a daunting task. 1.023 palabras más


Moving data from oracle to HDFS

I divided the scope into two stages:

Stage -1 | Initial data load from Oracle database to HDFS (for bulk load)

  • Here I used oracle native utility (Copy2hadoop) for reducing the friction…
  • 533 palabras más