Can Amazon Web Services (AWS) be used for huge databases? -
Can Amazon Web Services (AWS) be used for huge databases? -
i have project requires storage of several dozens of tbs
of data. @ first, few tbs
of info stored. then, info expected grow @ pace of 5 gb
per day.
based on business logic, able split across 10,000s
of files / tables (each file / table consist of info of size ~ 1 [gb]
).
(note: @ given point in time, 10-20 [gb]
(say, 20 tables) considered hot data
i'll actively query against.)
can amazon web services (aws) used store & query such huge amount of data?
if instead i'd store info in-house, best way approach (for instance, clear cannot have of info on 1 machine etc...)?
lacking info info format, schema, etc, here's how i'd handle it:
store info in s3 in map/reduce friendly format use hadoop (or elastic map reduce) query entire dataset when needed. typical pattern utilize m/r answers need ahead of time , store much smaller results in other database. if there's need have 20gb of 'hot data' in sql, i'd have etl loading amazon rds (mysql), i'd seek avoid if utilize m/r reply questions , have little , optimized info set. database amazon-web-services
Comments
Post a Comment