In this article we will create a Apache Zookeeper cluster on three machines, we have three Ubuntu machines with 1 GB ram.
Apache Kafka Multi Broker cluster setup on Ubuntu, In this article we will create a Apache Kafka multi broker cluster on three machines, we have three ubuntu machine with 4GB ram.
In previous article we have seen, how to write a "Kafka producer in java". In this article we will see how to write a kafka consumer in java with automatic offset committing to get data from kafka cluster.
In previous article we have seen, how to write a "Kafka producer in java". In this article we will see how to write a kafka consumer in java with manual offset committing to get data from kafka cluster.
In previous article we have seen, how to write a "Kafka producer in java". In this article we will see how to write a kafka consumer in java that can be assigned manually to a specific partition.
Apache Spark Multi-Node cluster can be setup using cluster managers like Hadoop YARN, Apache Mesos or Standalone spark cluster manager. In this article we will see how to setup Apache Spark cluster on ubuntu machines using Simple standalone spark cluster manager.
In this article we will see "How to create Spark Java Application and Submit it to Spark Cluster" and submit it to spark cluster to be executed. We will create a maven Java application with Spark Java API.
In this article we will see, what is Spark SQL, SQLContext and SparkSession, how to create SQLContext and SparkSession in Spark and their implementation.
In this article we will see what are DataFrames in Apache Spark, how to create them and their operations using Spark Java API. We will also look into how to create a DataFrame from different sources like RDD, Java Lits, JSON and MySql etc.
Apache Redis is an open source in memory cache store, used as a database cache and message broker. Redis is rich in data structures like strings, hashes, lists, sets, sorted sets with range queries, bitmaps, hyperlog logs and geospatial indexes with radius queries. Apart from that Redis provides built-in replication, Lua scripting, LRU eviction, transactions and different levels of on-disk persistence, and provides high availability via Redis Sentinel and automatic partitioning with Redis Cluster.
Redis cluster provides automatic sharding or data replication on multiple nodes and some degree of fault tolerance, some degree means when majority of nodes are up and running cluster tends to work.
Redis cluster uses two ports client port i.e. 7000 and one internal communication port that is always (the client port + 10000) and fixed, so make sure there two ports are always open and do the needful in server security and firewall if needed.
In order to maintain fault tolerance, we must have to adopt master and slave model meaning that a master node should have 1 to N complete replicas; more replicas more fault tolerance.
In previous articles we have already seen, how to setup Redis Standalone Server in Ubuntu. In this particular article we will setup Spring Data with Standalone Redis Using Jedis Client.
In this article, we will see, how one can save memory on Redis while storing data in form of byte buffer. The Java application will get and set key-values on redis using byte arrays, while converting byte array to/from byte buffer.
In this blog we will dive into Introduction and features of solr4.0 and will came to know how solr is a useful search server for full text searching. Solr is a simple configuration based implementation of Full Text Searching over lucene libraries.
In this blog i will tell you how to integrate Solr4.0 with apache-tomcat 7 in linux environment. Install Tomcat on your machine and and make sure it is ready to start.
In this particular blog we will come across a very useful feature of solr4 that is highlighting the search keywords in search data. In solr4 highlighting part cab be configured in request url as well as solrconfig.xml.
In this article we will go through Importing and indexing My Sql database table data in solr4 using Data Import Handler. Now its time to provide solr4 some data on which the search will be done.
From our previous discussion we came to know that bigdata is a term used for describing rapid growth and maintainability of both structured and unstructured data.
In this blog we will get into an brief introduction of Big Data, we will come to know about the factors and statics about Big Data. Lets start with some real time examples and scenes around all of us.
In this particular blog i will provide you a short introduction on cloud computing. Today where IT is spreading very rapidly and application development is at its best, cloud computing has become a useful aspect of computing world.
Groovy is known as Java scripting language and a lot of groovy users are taking advantage of its flexible nature. Groovy is better known as a new age advanced substitute of Java, or we can say it a better Java.
In this article we will see, what is Neo4J and graph db and how to use Neo4J with Java, adding nodes, relationships, properties and much more, stay tight and go forward.
We will discuss how to create a database in MongoDB, how to create a table in MongoDb, how to insert data in a MongoDB Collection and how to update and delete data from MongoDB table(Collection)