Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Appanvil karma designer
summary What is a Data lake? A data lake is a centralized repository that stores all of your organization's data in its native format. This means that data can be stored in its raw, unprocessed form, including structured, semi-structured, and unstructured data. This makes data lakes highly scalable and flexible, as they can store any type or volume of data.Data lakes offer a number of benefits, including:Improved data accessibility: Data lakes make it easier for organizations to access and use their data. This is because data lakes store data in a way that is easy to query and analyze.Enhanced analytics capabilities: Data lakes can be used to support a wide range of analytics applications, including machine learning, artificial intelligence, and big data analytics.Accelerated innovation: Data lakes can help organizations to innovate faster by providing them with a centralized repository of data that can be easily accessed and analyzed. How Data lakes work Data lakes typically consist of three main components:Data ingestion: Data is ingested into the data lake from a variety of sources, e.g. databases and applications.Data storage: Data is stored in the data lake in its native format. This means that data is not processed or transformed before it is stored.Data processing: Data is processed in the data lake to prepare it for analysis. This may include cleaning, transforming, and enriching the data.In our architecture, Learn Amp data flows from our database into a series of S3 buckets. Our process is to take the data, process and structure it into a curated zone (S3 bucket) and into a structured data warehouse, in order to connect it to our BI Tool. Data Lakes vs. Data Warehouses Data lakes are often compared to data warehouses. Data warehouses are also centralized repositories of data, but they store data in a structured format that is optimized for reporting and analysis. Data lakes, on the other hand, can store any type of data, including unstructured data. This makes data lakes more flexible than data warehouses. Note: Your data is stored in a curated, isolated and structured form within the Learn Amp data lake. Our data pipelines process a sub-set of your most meaningful data. We make this sub-set available to you in our data lake. Return to homepage
page{"premium":true,"id":"pdN0WowSrVG8F4fryc-j1","name":"page","children":[{"params":{"background":"#ffffff","padding":73,"gap":10},"children":[{"name":"row","children":[{"name":"column","children":[{"name":"image","params":{"templateId":"square","alignment":"center","width":100,"height":100,"position":"center center","borderRadius":{"all":20,"bbl":0,"bbr":0,"btl":0,"btr":0,"isIndividualCorners":false},"image":{"value":"att448266262","target":"_blank","type":"attachment"}},"children":[],"id":"WMxKuf79M5-QgwzaJ8nVr"}],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"cgrj7PG4GTJMi0UxI41N6"},{"name":"column","children":[{"name":"text","params":{"value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"fontFamily":"Poppins, sans-serif","text":"What is a Data lake?","fontSize":22,"letterSpacing":0,"lineHeight":"35px","color":"#2A2A2A"}],"align":"center"}]}]},"children":[],"id":"DTcH3DS3QvpAmVclBtLgC"},{"name":"divider","params":{"templateId":"solid short","color":"#E0A804","alignment":"center","fontSize":24,"fontColor":"#000000","height":1,"borderStyle":"solid","width":200},"children":[],"id":"pT6EPjeQMrjbrXf-mwaPE"}],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"cq2IWUh5E-g0cbyVU6hOR"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"9zhpAEKMlbMFBvambmBfS"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"t0bBWXGK-Zcx12gAgHjMJ"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"mtfM2T2_I5kFO2zw9TGi5"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"5Z6McsHaHqxEXTbh1xs0R"}],"params":{"layout":[1,2],"gap":15,"minHeight":200,"padding":10,"borderRadius":0},"id":"2knMWh2YlWaZYz6rxZqEG"}],"name":"section","id":"9SGkhyV2xDFyXbmjo4PjG"},{"id":"wTI7NtJSNBOnll3p1Z20c","params":{"background":{"light":"#ffffff","dark":"#1d2125"},"padding":5,"gap":10},"children":[{"id":"aOhjYy3k5MlH85WhQtMes","name":"row","children":[{"id":"Jcqdwb_fxBFK2tDQAhyrX","name":"column","children":[{"name":"text","params":{"templateId":"simple paragraph","value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"text":"","fontFamily":"Poppins, sans-serif"}]},{"type":"paragraph","children":[{"text":"A data lake is a ","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":16,"lineHeight":"35px","color":"#2A2A2A","fontSize":18},{"letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":16,"lineHeight":"35px","colorfontSize":"#2A2A2A"18,"text":"centralized repository","fontWeightcolor":700"#FF851A"},{"letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":16,"lineHeight":"35px","color":"#2A2A2A","fontSize":18,"text":" that stores all of your organization's data in its native format. This means that data can be stored in its raw, unprocessed form, including structured, semi-structured, and unstructured data. This makes data lakes highly scalable and flexible, as they can store any type or volume of data."}],"align":"center"},{"type":"paragraph","children":[{"type":"paragraph","align":"center","children":[{"letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":16,"lineHeight":"35px","color":"#2A2A2A","text":"","fontSize":18}]}]},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"","fontSize":18}],"align":"center"},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data lakes offer a number of benefits, including:","fontSize":18}],"align":"center"},{"type":"bulleted-list","children":[{"type":"list-item","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Improved data accessibility: Data lakes make it easier for organizations to access and use their data. This is because data lakes store data in a way that is easy to query and analyze."}]},{"typefontSize":18}],"align":"center"},{"type":"list-item","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Enhanced analytics capabilities: Data lakes can be used to support a wide range of analytics applications, including machine learning, artificial intelligence, and big data analytics.","fontSize":18}],"align":"center"},{"type":"list-item","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Accelerated innovation: Data lakes can help organizations to innovate faster by providing them with a centralized repository of data that can be easily accessed and analyzed.","fontSize":18}],"align":"center"}]},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]}]}]},"children":[],"id":"bwLlqQFnOQ0nrl4-7CtC_"},{"name":"text","params":{"templateId":"headline 2","value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"text":"How Data lakes work","letterSpacing":0,"fontWeight":700,"fontFamily":"Poppins, sans-serif","colorfontSize":{22,"lightcolor":"#555"},"fontSize":22#FF851A"}],"align":"center"}]}]},"children":[],"id":"KSmv5JQoikiT7LFSOkCYn"},{"name":"text","params":{"templateId":"simple paragraph","value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]},{"type":"paragraph","children":[{"fontSizelineHeight":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data lakes typically consist of three main components:","fontSize":18}],"align":"center"},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"","fontSize":18}],"align":"center"},{"type":"bulleted-list","children":[{"type":"list-item","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data ingestion: Data is ingested into the data lake from a variety of sources, e.g. databases and applications.","fontSize":18}],"align":"center"},{"type":"list-item","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data storage: Data is stored in the data lake in its native format. This means that data is not processed or transformed before it is stored.","fontSize":18}],"align":"center"},{"type":"list-item","children":[{"fontSizelineHeight":16,"lineHeight":""24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data processing: Data is processed in the data lake to prepare it for analysis. This may include cleaning, transforming, and enriching the data.","fontSize":18}],"align":"center"}]},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"","fontSize":18}],"align":"center"},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"In our architecture, Learn Amp data flows from our database into a series of "},{"fontSize":1618},{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"S3 buckets","fontWeight":700},{"fontSize":1618},{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":". Our process is to take the data, process and structure it into a curated zone (S3 bucket) and into a structured data warehouse, in order to connect it to our BI Tool.","fontSize":18}],"align":"center"},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]}]}]}]},"children":[],"id":"xr3padScffmk8fAfgP1JQ"},{"name":"text","params":{"templateId":"headline 3","value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data Lakes vs. Data Warehouses","fontWeightfontSize":70022,"fontSizecolor":22"#FF851A"}],"align":"center"}]}]}]}]},"children":[],"id":"sPJUnibbGAfRNAkeqYWh_"},{"name":"text","params":{"templateId":"simple paragraph","value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"","fontSize":18}],"align":"center"},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"Data lakes are often compared to data warehouses. Data warehouses are also centralized repositories of data, but they store data "},{"fontSize":1618},{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":"in a structured format that is optimized for reporting and analysis.","fontWeight":700},{"fontSize":1618},{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":" Data lakes, on the other hand, can store any type of data, including unstructured data. This makes data lakes more flexible than data warehouses.","fontSize"}]},{"type":"paragraph:18}],"align":"center"},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]},{"type":"paragraph","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","text":""}]}]}]}]}]},"children":[],"id":"qdZs1yxREiDTpmrCkmg5i"}],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"FasltqCdNb6c0-OesADcd","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"dS7wl9vrAxz7dNXG3LKps","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"02XpKd3LCEDFIPSyhvywp","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"jR5i2nL0uGRp3gOIC5xVx","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"KAuegROtDFxGe4FCJIK3B","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}}],"params":{"layout":[1],"gap":10,"minHeight":200,"padding":10,"borderRadius":0}}],"name":"section"},{"id":"HhtkadvrYnNpAB3JAXKsF","params":{"background":{"light":"#ffffff","dark":"#1d2125"},"padding":36,"gap":10},"children":[{"id":"koo-fu_q0lh-fjbxLrtvS","name":"row","children":[{"id":"zxm4JRNEoEbVUL8ybGkaZ","name":"column","children":[{"name":"text","params":{"templateId":"simple paragraph","value":[{"type":"paragraph","children":[{"type":"paragraph","children":[{"lineHeight":"24px","letterSpacing":0,"text":"Note:","fontFamily":"Poppins, sans-serif","color":"#FF851A","fontSize":22},{"lineHeight":"24px","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":18,"color":"#FF851A","text":" "}],"align":"center"},{"type":"paragraph","align":"center","children":[{"lineHeight":"24px","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":18,"color":"#FF851A","text":""}]},{"type":"paragraph","align":"center","children":[{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":18,"text":"Your data is stored in a curated, isolated and structured form within the Learn Amp data lake. "}]},{"type":"paragraph","align":"center","children":[{"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":18,"text":"Our data pipelines process a sub-set of your most meaningful data. "}]},{"type":"paragraph","align":"center","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":18,"text":"We make this sub-set available to you in our data lake."}]},{"type":"paragraph","align":"center","children":[{"fontSize":16,"lineHeight":"24px","color":"#555","letterSpacing":0,"fontFamily":"Poppins, sans-serif","fontSize":18,"text":""}]}]}]}]}]},"children":[],"id":"qdZs1yxREiDTpmrCkmg5iR9AZKwl5rE8_zJT6AzTwh"}],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"FasltqCdNb6c0-OesADcdF1Y1lGxBes90YsOJg8QNj","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"dS7wl9vrAxz7dNXG3LKpszmC4oOX-SKZ7sE3VEIsby","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"02XpKd3LCEDFIPSyhvywpvhoSKl9YMVX1IFO3aI-st","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"jR5i2nL0uGRp3gOIC5xVxBshTeMlV9wwddf4Mwgsbs","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}},{"id":"KAuegROtDFxGe4FCJIK3BBa2ilpIHXvcBIhDdDvMB2","name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"top"}}],"params":{"layout":[1],"gap":10,"minHeight":20050,"padding":10,"borderRadius":0}}],"name":"section"},{"params":{"background":"#ffffff","padding":7929,"gap":10},"children":[{"name":"row","children":[{"name":"column","children":[{"name":"divider","params":{"templateId":"solid short","color":"#E0A804","alignment":"center","fontSize":24,"fontColor":"#000000","height":3,"borderStyle":"solid","width":100},"children":[],"id":"OMJNJKIvvlbyfS-i-JlFY"},{"name":"button","params":{"templateId":"regular button","label":"Return to homepage","size":"medium","shape":"circular","alignment":"center","states":{"idle":{"colors":{"background":"#E0A804","label":"#F5F5F5"}},"hover":{"colors":{}}},"link":{"value":"https://learnamp.atlassian.net/wiki/spaces/KB/overview","target":"_blank","type":"link"}},"children":[],"id":"dI04ynpg3qGtipDIucUIh"}],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"pZ3DI27SjyQo0FJZTfJkb"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"JKa38SWxnzmLloFy4gzzW"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"yPWnskBJb6CaJz3jJPStn"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"6nT6258yUWFlYzIODTxSX"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"Cj7qDZ1iHFUa81aWux7rr"},{"name":"column","children":[],"params":{"borderRadius":{"all":0,"btl":0,"bbl":0,"btr":0,"bbr":0,"isIndividualCorners":false},"padding":0,"gap":20,"verticalAlignment":"center"},"id":"qITEv35EYstxAmcY3vMYz"}],"params":{"layout":[1],"gap":10,"minHeight":200,"padding":10,"borderRadius":0},"id":"kkq0b5SAahafEMfMThp6n"}],"name":"section","id":"A51-6PW_lj8tZfVDmBsFM"}]}