Skip to main content

BIG DATA

OBJECTIVES : 

-> What is big data?
-> How to handle big data?
->Solutions for big data


BIG DATA:
Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curation, search, sharing, storage, transfer, visualization, and information privacy.

                  Big data is a massive volume of data that refers to both structured and unstructured and difficult to process.It has mainly three aspects.They are:
             
                            1)velocity
                            2)volume
                            3)variety
velocity: size of the data.Big data is so massive that it gets accumulated at large amounts in very less amount of time.

volume: Size of the data is very large that it can't be processed using conventional methods.

 variety: Big data deals with both structured and unstructured data.



How to handle big data:

                                 there are many solutions provided to handle big data efficiently over many years.
 Companies like yahoo,facebook, google and many more people all around the world have worked to provide solutions to this problem.Some of the solutions avalable for big data are hadoop,pig,hive,spark.

Solutions for big data:

                    Hadoop is a open source project developed by Apache with the white paper algorith released by google. Google released a algorithm which explains how the search gaint handles large amounts of data.
Code is developed for same by the developers of yahoo and many people all over the world.

Hadoop uses map reduce function and HDFS for processing big data.  




 




Comments

Popular posts from this blog

Active Directory in Windows

                                                                Active Directory in Windows  When a windows server is bought it comes with many capabilities like AD groups for security, IIS(Internet Information Services), ability to built own VPN and many more of which Active directories are created with a domain and the computers which access the network will be connecting to the domain controllers. Practically, there will be a cluster of domain controllers where data will be replicated all over the cluster.

Internet.org

                                                        Internet.org Internet.org is a non-profitable oraganisation founded by facebook and six other companies to bring affordable access to internet services to less developed countries by increasing efficiency, and facilitating the development of new business models around the provision of Internet access. It has been criticized for violating net neutrality and favouring facebook's own services. An Indian journalist criticized it as" being just a Facebook proxy targeting India's poor" as it provides restricted Internet access to Reliance Telecom's subscribers in India.However, in early May 2015, Facebook announced that this platform would be opened to websites that met its criteria.           ...

Tail Log Backups

Tail log backups capture the tail of the log even if the database is offline, damaged, or missing data files. This might cause incomplete metadata from the restore information commands and msdb. However, only the metadata is incomplete; the captured log is complete and usable. If a tail-log backup has incomplete metadata, in the backupset table, has_incomplete_metadata is set to 1. Also, in the output of RESTORE HEADERONLY, HasIncompleteMetadata is set to 1. To create a tail log backup, use below script BACKUP LOG [DB Name]    TO [TLog_Device_Location]      WITH NO_TRUNCATE;   GO Scenarios That Require a Tail-Log Backup If the database is online and you plan to perform a restore operation on the database, begin by backing up the tail of the log. To avoid an error for an online database, you must use the ... WITH NORECOVERY option of the BACKUP Transact-SQL statement. If a database is offline and fails to start and you need to res...