<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>chd Archives - Albert Nogués</title>
	<atom:link href="https://www.albertnogues.com/tag/chd/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.albertnogues.com/tag/chd/</link>
	<description>Data and Cloud Freelancer</description>
	<lastBuildDate>Fri, 11 Dec 2020 12:31:30 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	

<image>
	<url>https://www.albertnogues.com/wp-content/uploads/2020/12/cropped-cropped-AlbertLogo2-32x32.png</url>
	<title>chd Archives - Albert Nogués</title>
	<link>https://www.albertnogues.com/tag/chd/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Cloudera Manager installation in Google Cloud</title>
		<link>https://www.albertnogues.com/cloudera-manager-installation-in-google-cloud/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=cloudera-manager-installation-in-google-cloud</link>
		
		<dc:creator><![CDATA[Albert]]></dc:creator>
		<pubDate>Mon, 04 Dec 2017 21:25:39 +0000</pubDate>
				<category><![CDATA[BigData]]></category>
		<category><![CDATA[Blog]]></category>
		<category><![CDATA[chd]]></category>
		<category><![CDATA[cloudera]]></category>
		<category><![CDATA[cloudera manager]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[manager]]></category>
		<guid isPermaLink="false">http://192.168.1.40/?p=273</guid>

					<description><![CDATA[<p>Installation of Cloudera Manager and small CDH Cluster Lab in Google Cloud As a preparation for the CCA Administration certification we need a workable cluster to do our practice tests. I&#8217;m going to start showing you how to install a CDH cluster in the google cloud. To start with, we need to proceed with the &#8230; </p>
<p>The post <a href="https://www.albertnogues.com/cloudera-manager-installation-in-google-cloud/">Cloudera Manager installation in Google Cloud</a> appeared first on <a href="https://www.albertnogues.com">Albert Nogués</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Installation of Cloudera Manager and small CDH Cluster Lab in Google Cloud</p>
<p>As a preparation for the <a href="https://www.cloudera.com/more/training/certification/cca-admin.html">CCA Administration certification</a> we need a workable cluster to do our practice tests.</p>
<p>I&#8217;m going to start showing you how to install a CDH cluster in the google cloud.</p>
<p>To start with, we need to proceed with the cloudera manager installation, which will guide us to the process of setting up our small cluster, to work as a test lab. The cluster will consist in 2 master nodes (we will enable hdfs HA and yarn HA) and 3 workers, as by default hdfs replication is 3 and we will stick to that. We will use one of the workers as a gateway too, to simplify the number of machines we use.</p>
<p>Apart of these 5 machines, we will have the cloudera manager in a separate machine, so in total, we will need 6 machines. The CPU and RAM required for each machine will vary too, but to make it easier, we will use only two types of instances. Let&#8217;s start with the cloudera manager first, in this post and in the following one we will set up the cluster.</p>
<p><strong>Cloudera Manager Installation</strong></p>
<p>Open your google cloud control panel, go to the compute engine section, VM Instances and click on create. A new tab will open with some default values that we need to edit.</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM1.png"><img fetchpriority="high" decoding="async" class="alignnone size-large wp-image-274" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM1-1024x876.png" alt="" width="980" height="838" srcset="https://www.albertnogues.com/wp-content/uploads/2017/12/CM1-1024x876.png 1024w, https://www.albertnogues.com/wp-content/uploads/2017/12/CM1-300x257.png 300w, https://www.albertnogues.com/wp-content/uploads/2017/12/CM1-768x657.png 768w, https://www.albertnogues.com/wp-content/uploads/2017/12/CM1-70x60.png 70w, https://www.albertnogues.com/wp-content/uploads/2017/12/CM1.png 1038w" sizes="(max-width: 980px) 100vw, 980px" /></a></p>
<p>I will name the instance Cloudera Manager to simplify things. Choose the appropriate zone based in your geographic location, and as a machine type we will select the n1-highmem-2 which contains 2 virtual cpus and 13 GB of ram, as using the standard n1-standard-2 instance is risky as cloudera manager may not boot because it may not have enough RAM. It requires approximately 8 GB of ram, and having less, causes unexpected errors. SO we are going to play safe.</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM2.png"><img decoding="async" class="alignnone wp-image-275" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM2.png" alt="" width="402" height="595" srcset="https://www.albertnogues.com/wp-content/uploads/2017/12/CM2.png 451w, https://www.albertnogues.com/wp-content/uploads/2017/12/CM2-203x300.png 203w, https://www.albertnogues.com/wp-content/uploads/2017/12/CM2-41x60.png 41w" sizes="(max-width: 402px) 100vw, 402px" /></a></p>
<p>As a boot disk, we are going to change the default debian for a centos7. Since this will only host the cloudera manager installation,, the disk will only hold that and the OS Files. Just in case we will increase the standard 10 GB of disk to 20 GB to be on the safer side. Leave all the other options by default, even that later we will need to work on the Firewall, to open the cloudera manager port. Once finished, click on CREATE.</p>
<p>After a couple of seconds, we will have our instance created. It will show the private and public IP. As obvious, from the outside, we will need the Public IP to connect through SSH.</p>
<p>Before connecting through ssh, we need to generate a key pair. Follow the google cloud guide for this purpose <a href="https://cloud.google.com/compute/docs/instances/connecting-to-instance#generatesshkeypair">here. </a></p>
<p>Once generated the keys, we can proceed to connect to our machine, we open our ssh client or the one integrated in google cloud (using the web browser) and we will be in:</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM3.png"><img decoding="async" class="alignnone size-full wp-image-276" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM3.png" alt="" width="602" height="74"></a></p>
<p>Before installing cloudera manager, let&#8217;s do a few configuration tasks that we may require later. These are to disable selinux and update the system.</p>
<p>To disable selinux, simply do a <em>sudo vi /etc/sysconfig/selinux </em>and set<em> selinux=disabled </em>and <strong>rebot the machine.</strong></p>
<p>then:</p>
<p><strong>sudo yum update &amp;&amp; sudo yum install wget</strong></p>
<p>After installing wget, we can proceed to download the cloudera manager installer:</p>
<p><strong>wget http://archive.cloudera.com/cm5/installer/latest/cloudera-manager-installer.bin</strong></p>
<p><strong>chmod u+x cloudera-manager-installer.bin</strong></p>
<p><strong>sudo ./cloudera-manager-installer.bin</strong></p>
<p>And the installer will start. Then we only need to follow the wizard through the setup process</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM4.png"><img loading="lazy" decoding="async" class="alignnone size-full wp-image-277" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM4.png" alt="" width="642" height="386"></a></p>
<p>and the download of the java sdk and the cloudera manager will start. The last step, will show us the url of the cloudera manager but note that, first, the url is give using the machine name and with that url we wont be able to get to the server and second, we have to open the port in the firewall so it becomes accessible from the outside.</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM5.png"><img loading="lazy" decoding="async" class="alignnone size-full wp-image-278" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM5.png" alt="" width="643" height="386"></a></p>
<p>So, now, we wait for a couple of seconds and we go back to our google cloud dashboard, click on VPC Network &gt; Firewall Rules, as seen on the image.</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM6.png"><img loading="lazy" decoding="async" class="alignnone wp-image-279" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM6.png" alt="" width="355" height="554"></a></p>
<p>And now, click on create firewall rule, name it cm7180. In the targets select either your instance or all instances in the network (not recommended), and in source ip ranges input either your ip if you connect always from the same place or 0.0.0.0/0 or any other subnet provided by your ISP and on the protocols and ports, select &#8220;specified protocols and ports&#8221; and write <strong>tcp:7180</strong></p>
<p>Of you will still have issues connecting, you can try to disable firewalld which may be blocking the access. At this point, you can either disable it completely, or configure it to give access to the port 7180.</p>
<p>After that, point your browser to <strong>http://&lt;your public ip&gt;:7180</strong> and you should see the cloudera manager login screen. The username and password to log in are <strong>admin/admin</strong> and these should be changed ASAP.</p>
<p><a href="https://www.albertnogues.com/wp-content/uploads/2017/12/CM7.png"><img loading="lazy" decoding="async" class="alignnone size-large wp-image-280" src="https://www.albertnogues.com/wp-content/uploads/2017/12/CM7-1024x600.png" alt="" width="980" height="574"></a></p>
<p>The screen shows it in spanish because cloudera manager inherits your browser locale configuration. When we login, we can finish the installation of cloudera manager, which basically consists in accepting the license terms, and choosing the license from: Cloudera Manager Express, Cloudera Enterpise 60 day trial or Cloudera Enterprise with license.</p>
<p>At this point we will use the 60 day trial, as we want to perform some tasks in the cluster that are not possible with the express version.</p>
<p>And here it finishes the installation of cloudera manager. In a next post we will cover the installation of CDH which is just the wizard that follows the license selection in cloudera manager.</p>
<p>Stay tuned for next chapter soon!</p>
<p>The post <a href="https://www.albertnogues.com/cloudera-manager-installation-in-google-cloud/">Cloudera Manager installation in Google Cloud</a> appeared first on <a href="https://www.albertnogues.com">Albert Nogués</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
