<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Lone Sysadmin &#187; System Administration</title>
	<atom:link href="http://lonesysadmin.net/category/system-administration/feed/" rel="self" type="application/rss+xml" />
	<link>http://lonesysadmin.net</link>
	<description>Virtualization, System Administration, and Technology.</description>
	<lastBuildDate>Sun, 29 Aug 2010 21:54:07 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Three Organizational Decisions That Help Me Virtualize</title>
		<link>http://lonesysadmin.net/2010/08/06/three-organizational-decisions-that-help-me-virtualize/</link>
		<comments>http://lonesysadmin.net/2010/08/06/three-organizational-decisions-that-help-me-virtualize/#comments</comments>
		<pubDate>Fri, 06 Aug 2010 18:55:55 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[General Rambling]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2762</guid>
		<description><![CDATA[Over the last ten years my organization has come a long way with its IT policies and processes. We&#8217;ve gone from the wild, wild west of IT where personal heroism ruled the day, to a place where there&#8217;s just enough process to make sure that communication happens correctly and things like our Configuration Management Database [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2009/04/07/how-much-capacity-should-i-have-for-vmware/" rel="bookmark">How Much Capacity Should I Have For VMware?</a><!-- (33.266)--></li>
		<li><a href="http://lonesysadmin.net/2008/08/13/why-this-vmware-time-bomb-issue-is-a-big-deal/" rel="bookmark">Why This VMware Time Bomb Issue is a Big Deal</a><!-- (32.889)--></li>
		<li><a href="http://lonesysadmin.net/2008/06/17/leopard-on-esx-would-be-nice/" rel="bookmark">Leopard on ESX Would Be Nice</a><!-- (30.8466)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>Over the last ten years my organization has come a long way with its IT policies and processes. We&#8217;ve gone from the wild, wild west of IT where personal heroism ruled the day, to a place where there&#8217;s just enough process to make sure that communication happens correctly and things like our Configuration Management Database (CMDB) stay up to date. It&#8217;s been a lot of work, but I am actually really proud of where we&#8217;re at.</p>
<p>There are three fundamental decisions we made a long time ago that, had they not been made, would have drastically changed how virtualization has proceeded here.</p>
<p><strong>1. Clearly defined maintenance windows.</strong></p>
<p>Knowing exactly when someone can do maintenance on server has been crucial to getting things done in our virtualization environment. There are many adjustments you can &amp; should make in virtual environments, but if you can&#8217;t ever take the VMs down to make the changes you&#8217;re stuck. We&#8217;ve been able to do physical to virtual migrations, performance tuning, VMware Tools upgrades, vSphere upgrades, and a whole slew of other things in relatively short timeframes because we have this all worked out already. This also lets us &#8220;right-size&#8221; our VMs &#8212; rather than deploying huge VMs just in case they need the CPU or RAM, we deploy smaller ones and then can take an outage to add CPUs and RAM if we need to. The maintenance windows for a server are negotiated between the application/service admins and the system administrators when a machine is put into production, we track it in our CMDB, and any member of the whole team supporting the service can take the maintenance window, as long as they follow some rules about notifications for the change (timeframes, etc.).</p>
<p><strong>2. Use of load-balancing technologies.</strong></p>
<p>We use application load balancers (layer 4 of the OSI model) to decouple services from individual servers. Not only does this allow us to take a host down without affecting a service, but it also lets us spread the load out more among the physical hosts we have in our virtual infrastructure. In a lot of cases having more, smaller VMs results in better workload scheduling by ESX and DRS, especially on smaller ESX hosts.</p>
<p>Of course, this also plays nicely into the other points, because it&#8217;s very liberating to be able to do what we call &#8220;rolling maintenance&#8221; on a service, just taking one machine down at a time so that customers are not impacted. It also means that system administrator quality of life goes up, for now we can do maintenance tasks during the day instead of on weekends and off-hours. Doing maintenance during business hours has a couple of benefits. First, it means that the maintenance will actually get done. If you try to use someone&#8217;s personal time to do work they tend to opt out of that work. Servers go unpatched, tuning doesn&#8217;t happen, lots of things that should get done don&#8217;t because people will choose their personal time over work. Second, it means that if something goes wrong there are others around to help out. Doing work at 5 AM on a Sunday is fun, but if things go sideways you have to wake someone up or try fixing it yourself. Doing work during the day means you have the rest of the team around to lend a hand.</p>
<p>Third, it gives you a way to make incremental changes and then watch the effects. This has been particularly awesome for performance tuning of applications and our virtual environments themselves. Testing tuning changes is often hard, because test suites and test load generators are synthetic and often don&#8217;t compare to real load. But because the load is spread out we can make a change to one VM, or one ESX host servicing one VM, and keep an eye on it. I&#8217;m not advocating being a complete cowboy &#8212; you still have to do testing &#8212; but the risks to your production environment are a lot lower if you can catch problems on one VM first.</p>
<p>There are usually some other benefits to load balancers, too, that make them virtualization-friendly. Many will offload SSL processing, so your VMs have less work to do. Others have features, like iRules in F5&#8242;s products, that let you rewrite network traffic on the fly, which has some really neat implications for security, monitoring, and service delivery. And if you don&#8217;t want to buy a piece of hardware you can often get a virtual appliance from these vendors, though the physical appliances are usually a lot faster.</p>
<p><strong>3. Commitment to operating system and application patching.</strong></p>
<p>It is a fundamental belief of mine that one of the best ways to stay secure is to keep up on your patching. My organization agrees, and by using load balancers and defining maintenance windows we&#8217;ve made it easy for ourselves to keep our hosts up to date with regular patching cycles. Because we can take servers down without taking services down, and because sysadmins know exactly when a server can come down, we can schedule maintenance cycles easily, whether it&#8217;s six months out or two weeks. We can also respond very rapidly to emergency situations, like recent remote execution vulnerabilities in Microsoft Windows, by rolling patches out to development &amp; test hosts, then QA &amp; production, over the course of just two days if needed.</p>
<p>Keeping up to date with patches not only keeps you secure, it also lets you take advantage of new features that are added to operating systems. For example, Red Hat keeps adding new virtualization-friendly features, like kernel interrupt clock dividers. Being a kernel parameter you can&#8217;t just change it on the fly. And if you have to reboot, but can&#8217;t get a time to do it, you won&#8217;t do it. For us, we just rolled the change into one of our patching cycles and reduced the load on our infrastructure dramatically. Meaning more VMs per physical host, and a quantifiable amount of savings from just a small change on each machine.</p>
<p>Furthermore, our commitment to patching also extends to the virtual infrastructure itself, and we have a rule that we will not implement anything that breaks vMotion or Storage vMotion. Why? Because then it becomes very difficult to cope with ESX updates, or hardware failures, or any situation where vMotion could be used to prevent an outage. Sure, this means that we still need physical hardware for some applications, but it&#8217;s still just a fraction of the hardware we were buying years ago. This also makes virtual infrastructure easy to upgrade when the time comes, for new versions of vSphere, new storage arrays, and new physical hosts. Instead of planning outages on hundreds of VMs we just vMotion them, and nobody is the wiser.</p>
<p><em>Disclosure: F5 is a sponsor of Gestalt IT Tech Field Day, of which I have been a participant. I am not a customer of F5 at this time, though.</em></p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2762&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2009/04/07/how-much-capacity-should-i-have-for-vmware/" rel="bookmark">How Much Capacity Should I Have For VMware?</a><!-- (33.266)--></li>
		<li><a href="http://lonesysadmin.net/2008/08/13/why-this-vmware-time-bomb-issue-is-a-big-deal/" rel="bookmark">Why This VMware Time Bomb Issue is a Big Deal</a><!-- (32.889)--></li>
		<li><a href="http://lonesysadmin.net/2008/06/17/leopard-on-esx-would-be-nice/" rel="bookmark">Leopard on ESX Would Be Nice</a><!-- (30.8466)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/08/06/three-organizational-decisions-that-help-me-virtualize/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Happy System Administrator Appreciation Day!</title>
		<link>http://lonesysadmin.net/2010/07/30/happy-system-administrator-appreciation-day/</link>
		<comments>http://lonesysadmin.net/2010/07/30/happy-system-administrator-appreciation-day/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 17:01:06 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[General Rambling]]></category>
		<category><![CDATA[System Administration]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2757</guid>
		<description><![CDATA[The Wisconsin DMV sent me my gift a day early: And it was a present &#8212; I needed replacement plates but hadn&#8217;t ordered them yet. I&#8217;m glad I didn&#8217;t! I often joke that I haven&#8217;t come up with an original solution to anything in years, thanks to all the other sysadmins out there who share [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/terms-of-use/" rel="bookmark">Terms of Use</a><!-- (70.2905)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/01/license-changes/" rel="bookmark">License Changes</a><!-- (47.6057)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/16/creative-commons/" rel="bookmark">Creative Commons</a><!-- (46.5517)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>The Wisconsin DMV sent me my gift a day early:</p>
<p><img class="alignnone size-full wp-image-2758" title="UN1X" src="http://lonesysadmin.net/wp-content/uploads/2010/07/photo.png" alt="" width="450" height="232" /></p>
<p>And it was a present &#8212; I needed replacement plates but hadn&#8217;t ordered them yet. I&#8217;m glad I didn&#8217;t!</p>
<p>I often joke that I haven&#8217;t come up with an original solution to anything in years, thanks to all the other sysadmins out there who share their solutions, knowledge, and time in order to make the world better. Thank you all for everything you do!</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2757&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/terms-of-use/" rel="bookmark">Terms of Use</a><!-- (70.2905)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/01/license-changes/" rel="bookmark">License Changes</a><!-- (47.6057)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/16/creative-commons/" rel="bookmark">Creative Commons</a><!-- (46.5517)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/07/30/happy-system-administrator-appreciation-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rate-Limiting Steps</title>
		<link>http://lonesysadmin.net/2010/07/26/rate-limiting-steps/</link>
		<comments>http://lonesysadmin.net/2010/07/26/rate-limiting-steps/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 18:02:14 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[People Stuff]]></category>
		<category><![CDATA[System Administration]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2752</guid>
		<description><![CDATA[In the last month I&#8217;ve added quite a few blogs to my reading list. One new one is &#8220;Movin&#8217; Meat,&#8221; written by an ER doctor out of the Pacific Northwest. Besides just being interesting, some of his blog posts support my theory that IT folks can often learn things from people in other fields. The [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/03/08/my-designs-start-as-80-unjustified-raw-vision/" rel="bookmark">My Designs Start As 80% Unjustified Raw Vision</a><!-- (23.8924)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/28/misspellings-are-bad/" rel="bookmark">Misspellings Are Bad</a><!-- (19.4813)--></li>
		<li><a href="http://lonesysadmin.net/2007/04/19/why-no-html-in-microsoft-project-2007/" rel="bookmark">Why No HTML in Microsoft Project 2007?</a><!-- (17.5958)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>In the last month I&#8217;ve added quite a few blogs to my reading list. One new one is &#8220;Movin&#8217; Meat,&#8221; written by an ER doctor out of the Pacific Northwest. Besides just being interesting, some of his blog posts support my theory that IT folks can often learn things from people in other fields. The post from June 25, 2010, part four of his &#8220;Advice for Interns,&#8221; is one of these cases. When you read it (link is at the end because I want to get to my actual point before you leave to read it), I think substituting &#8220;customer/system&#8221; for &#8220;patient&#8221; in his list works nicely.</p>
<p>My real point is this: one thing in his list really stood out for me. It&#8217;s something that seems really obvious when it&#8217;s said, but also done wrong a lot:</p>
<blockquote><p>&#8220;Determine the rate-limiting step and make it priority #1 in the work-up&#8221;</p></blockquote>
<p>Figuring out what the slowest step in a project is going to be and getting to work on it right away is often key to getting a project done in a timely fashion. Especially if a large chunk of that time will be waiting for something. When you know it&#8217;s going to take six weeks for a request to make it through your purchasing department you should start that right away, especially since all you&#8217;ll be doing is waiting.</p>
<p>As kids we were told by our teachers to read all the instructions first, then start working on whatever we were doing. Determining the rate-limiting steps is the same sort of thing. By taking a few minutes at the beginning to look at the whole project first, rather than just starting on step #1 and going one by one until you&#8217;re done, you can often optimize things so that the longest parts of the project are done in parallel with the rest.</p>
<p>Links:</p>
<p>- Movin&#8217; Meat: <a href="http://allbleedingstops.blogspot.com/2010/06/friday-flashback-advice-for-interns_25.html">Friday Flashback &#8211; Advice for Interns Part Four</a></p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2752&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/03/08/my-designs-start-as-80-unjustified-raw-vision/" rel="bookmark">My Designs Start As 80% Unjustified Raw Vision</a><!-- (23.8924)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/28/misspellings-are-bad/" rel="bookmark">Misspellings Are Bad</a><!-- (19.4813)--></li>
		<li><a href="http://lonesysadmin.net/2007/04/19/why-no-html-in-microsoft-project-2007/" rel="bookmark">Why No HTML in Microsoft Project 2007?</a><!-- (17.5958)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/07/26/rate-limiting-steps/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Why No-Reply Email Is A Bad Idea</title>
		<link>http://lonesysadmin.net/2010/07/08/why-no-reply-email-is-a-bad-idea/</link>
		<comments>http://lonesysadmin.net/2010/07/08/why-no-reply-email-is-a-bad-idea/#comments</comments>
		<pubDate>Thu, 08 Jul 2010 21:17:32 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Outright Rant]]></category>
		<category><![CDATA[System Administration]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2712</guid>
		<description><![CDATA[I absolutely hate no-reply email. I understand why it exists (autoresponders and bounces), but to send an email with no way to respond at all using the same communications medium is ridiculous. A good example of this is the customer satisfaction survey Red Hat just sent me. It is from a no-reply email address and [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/06/08/best-reply-to-an-email-of-mine/" rel="bookmark">Best Reply To An Email Of Mine</a><!-- (37.9139)--></li>
		<li><a href="http://lonesysadmin.net/2006/03/22/i-hate-programmatic-email/" rel="bookmark">I Hate Programmatic Email</a><!-- (34.8589)--></li>
		<li><a href="http://lonesysadmin.net/2010/03/24/never-send-error-email-in-a-loop/" rel="bookmark">Never Send Error Email in a Loop</a><!-- (34.3761)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>I absolutely hate no-reply email. I understand why it exists (autoresponders and bounces), but to send an email with no way to respond at all using the same communications medium is ridiculous.</p>
<p>A good example of this is the customer satisfaction survey Red Hat just sent me. It is from a no-reply email address and there is no other email address listed. There is just some text and a URL, and clicking on the URL gets me:</p>
<p><a href="http://lonesysadmin.net/wp-content/uploads/2010/07/Red-Hat.png"><img class="alignnone size-full wp-image-2713" title="rhapps.redhat.com not found" src="http://lonesysadmin.net/wp-content/uploads/2010/07/Red-Hat.png" alt="rhapps.redhat.com not found" width="507" height="241" /></a></p>
<pre>$ dig rhapps.redhat.com ns1.redhat.com
[...snip...]
;; QUESTION SECTION:
;rhapps.redhat.com.             IN      A</pre>
<p>A records are overrated.</p>
<p>I generally am a nice guy and let vendors know something is messed up, but there are limits, especially when I&#8217;m already on the fence about a negative experience. I&#8217;m not going to open a support case with them, because it&#8217;ll never get escalated correctly[0]. And there&#8217;s no email address to send a quick note to. So it goes unfixed, Red Hat gets added to my mental list of vendors that don&#8217;t get it, and I blog about it, which may be worse than a negative survey response.</p>
<p>And for heaven&#8217;s sake, if you send out a customer satisfaction survey  make sure it works.[1]</p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;</p>
<p>[0] Here&#8217;s a test for your organization: can a customer open a support problem against your web site? Will it go to the right place, i.e. the webmasters or someone intelligent who can get things fixed? If not, why not?</p>
<p>[1] This also may mean you should add an external survey service to what you monitor.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2712&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/06/08/best-reply-to-an-email-of-mine/" rel="bookmark">Best Reply To An Email Of Mine</a><!-- (37.9139)--></li>
		<li><a href="http://lonesysadmin.net/2006/03/22/i-hate-programmatic-email/" rel="bookmark">I Hate Programmatic Email</a><!-- (34.8589)--></li>
		<li><a href="http://lonesysadmin.net/2010/03/24/never-send-error-email-in-a-loop/" rel="bookmark">Never Send Error Email in a Loop</a><!-- (34.3761)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/07/08/why-no-reply-email-is-a-bad-idea/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>More Equipment Means More To Go Wrong</title>
		<link>http://lonesysadmin.net/2010/06/30/more-equipment-means-more-to-go-wrong/</link>
		<comments>http://lonesysadmin.net/2010/06/30/more-equipment-means-more-to-go-wrong/#comments</comments>
		<pubDate>Wed, 30 Jun 2010 17:04:47 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2682</guid>
		<description><![CDATA[&#8220;Everything should be made as simple as possible, but not simpler.&#8221; &#8211; Albert Einstein Over the last couple years my organization has been building out an alternate site, for disaster recovery and business continuity purposes. I&#8217;ve noticed a disturbing trend, both among my own coworkers and others who are starting to think about DR &#38; [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2008/07/31/not-virtualizing-everything-or-know-your-dependencies/" rel="bookmark">Not Virtualizing Everything, or Know Your Dependencies</a><!-- (36.7811)--></li>
		<li><a href="http://lonesysadmin.net/2008/09/10/complexity-vs-availability/" rel="bookmark">Complexity vs. Availability</a><!-- (36.6995)--></li>
		<li><a href="http://lonesysadmin.net/2009/04/09/should-i-convert-my-old-servers-to-esx/" rel="bookmark">Should I Convert My Old Servers to ESX?</a><!-- (32.2973)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p><em>&#8220;Everything should be made as simple as possible, but not simpler.&#8221; &#8211; Albert Einstein</em></p>
<p>Over the last couple years my organization has been building out an alternate site, for disaster recovery and business continuity purposes. I&#8217;ve noticed a disturbing trend, both among my own coworkers and others who are starting to think about DR &amp; BC: that by having multiple data centers, multiple servers, or using multiple cloud vendors they&#8217;ll reduce the number of problems they&#8217;ll have.</p>
<p>From a system administrator&#8217;s point of view that idea is absolutely false.</p>
<p>Every piece of equipment you have can fail, and the more equipment you possess the more likely a failure will be. The more servers, switches, chillers, PDUs, and power grids you have the more likely it will be that one dies. Heck, that&#8217;s even true of components within the servers. As I add more servers to my own environment, each with mirrored disks and tens of DIMMs, I spend more time replacing failed drives, failed DIMMs, updating firmware, etc.</p>
<p>Alternate data centers also add other problems than just mechanical failure. Your second data center is probably attached to a different power grid, so now you have two power utilities to worry about, and are subject to twice as many outages. You also have to maintain the equipment in that other data center, change the air filters on the air handlers, etc. And it isn&#8217;t twice as much work as having one data center, it&#8217;s three or four times when you count the overhead of traveling to a remote site, not having tools available, etc.</p>
<p>Having alternate sites, and multiple servers, can certainly improve application reliability by keeping services available to customers in case something happens. It&#8217;s a good bet that a power problem in San Jose won&#8217;t affect your site in Omaha, for example. But it absolutely increases your odds of having equipment failure, and while your customers might be happy your operations folks won&#8217;t be.</p>
<p>More equipment means more that can go wrong. So what do you do? Keep it simple. Have fewer, bigger machines. Have machines with fewer moving parts. Use a cloud provider with multiple sites. Use VMware DPM and keep your alternate site machines in standby most of the time, and the HVAC off. There are lots of ideas, but remember that the more you have, the more that will fail, and those failures mean time spent not moving forward.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2682&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2008/07/31/not-virtualizing-everything-or-know-your-dependencies/" rel="bookmark">Not Virtualizing Everything, or Know Your Dependencies</a><!-- (36.7811)--></li>
		<li><a href="http://lonesysadmin.net/2008/09/10/complexity-vs-availability/" rel="bookmark">Complexity vs. Availability</a><!-- (36.6995)--></li>
		<li><a href="http://lonesysadmin.net/2009/04/09/should-i-convert-my-old-servers-to-esx/" rel="bookmark">Should I Convert My Old Servers to ESX?</a><!-- (32.2973)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/06/30/more-equipment-means-more-to-go-wrong/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Levels of Indirection</title>
		<link>http://lonesysadmin.net/2010/06/29/levels-of-indirection/</link>
		<comments>http://lonesysadmin.net/2010/06/29/levels-of-indirection/#comments</comments>
		<pubDate>Tue, 29 Jun 2010 20:08:21 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Quotes]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2676</guid>
		<description><![CDATA[&#8220;All problems in computer science can be solved by another level of indirection&#8230; Except for the problem of too many layers of indirection.&#8221; - David Wheeler, though often attributed to Butler Lampson, who has some great quotes, too: &#8220;When in doubt, use brute force.&#8221; &#8220;In handling resources, strive to avoid disaster rather than to attain [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/terms-of-use/" rel="bookmark">Terms of Use</a><!-- (69.4517)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/01/license-changes/" rel="bookmark">License Changes</a><!-- (47.8788)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/16/creative-commons/" rel="bookmark">Creative Commons</a><!-- (46.8012)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>&#8220;All problems in computer science can be solved by another level of indirection&#8230;<br />
Except for the problem of too many layers of indirection.&#8221;</p>
<p>- <a href="http://en.wikipedia.org/wiki/David_Wheeler_%28computer_scientist%29">David Wheeler</a>, though often attributed to <a href="http://en.wikipedia.org/wiki/Butler_Lampson">Butler Lampson</a>, who has some great quotes, too:</p>
<p>&#8220;When in doubt, use brute force.&#8221;<br />
&#8220;In handling resources, strive to avoid  disaster rather than to attain an optimum.&#8221;</p>
<p>Lots of good stuff if you read his &#8220;<a href="http://research.microsoft.com/en-us/um/people/blampson/33-hints/webpage.html">Hints for Computer System Design</a>.&#8221;</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2676&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/terms-of-use/" rel="bookmark">Terms of Use</a><!-- (69.4517)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/01/license-changes/" rel="bookmark">License Changes</a><!-- (47.8788)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/16/creative-commons/" rel="bookmark">Creative Commons</a><!-- (46.8012)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/06/29/levels-of-indirection/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Midnight is Always Tomorrow</title>
		<link>http://lonesysadmin.net/2010/06/14/midnight-is-always-tomorrow/</link>
		<comments>http://lonesysadmin.net/2010/06/14/midnight-is-always-tomorrow/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 17:17:22 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Outright Rant]]></category>
		<category><![CDATA[System Administration]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2666</guid>
		<description><![CDATA[&#8220;So, are you ready for the big power outage on Sunday?&#8221; a colleague asks on Thursday. &#8220;You mean Saturday.&#8221; &#8220;No&#8230; Sunday morning.&#8221; &#8220;Um, I was told two months ago, and countless times between, that the outage is on Saturday, midnight to 8 AM, and they were starting to shut things down at 10 PM.&#8221; &#8220;It&#8217;s [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/10/03/cihost-sucks-and-i-need-another-host/" rel="bookmark">CIHost Sucks and I Need Another Host</a><!-- (26.4591)--></li>
		<li><a href="http://lonesysadmin.net/2010/08/29/vmworld-2010-saturday/" rel="bookmark">VMworld 2010: Saturday</a><!-- (22.5834)--></li>
		<li><a href="http://lonesysadmin.net/2007/05/14/maintenance-windows/" rel="bookmark">What? Maintenance Windows?</a><!-- (20.0504)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>&#8220;So, are you ready for the big power outage on Sunday?&#8221; a colleague asks on Thursday.</p>
<p>&#8220;You mean Saturday.&#8221;</p>
<p>&#8220;No&#8230; Sunday morning.&#8221;</p>
<p>&#8220;Um, I was told two months ago, and countless times between, that the outage is on Saturday, midnight to 8 AM, and they were starting to shut things down at 10 PM.&#8221;</p>
<p>&#8220;It&#8217;s Sunday, midnight to 8 AM. They&#8217;re going to start shutting things down on Saturday at 10 PM.&#8221;</p>
<p>&#8220;Did they move the outage?&#8221;</p>
<p>&#8220;No, I bet they were just telling you when things were going to start. On Saturday.&#8221;</p>
<p><strong>Midnight is 00:00, meaning the start of a new day. </strong><strong>Always.</strong></p>
<p>If you&#8217;re in doubt, use 00:01. Assume everybody is clueless about time, because they are. For example, a lot of people think in terms of when they go to sleep, not what actual time it is, so if they&#8217;re still up at 0200 on Sunday they consider it to be Saturday. While that&#8217;s wrong, and makes visions of their painful, torturous death flash in your mind, it&#8217;s a fact of life. Deal with it.</p>
<p>Be precise. Use 24-hour time, because there is no AM/PM question. 24 hour time runs between 0000 and 2359 on any given day. <strong>There is no 2400</strong>[0].</p>
<p>Last, all times should be accompanied by days, and vice-versa. It&#8217;s like units in science classes. You didn&#8217;t just write &#8220;1.67,&#8221; you wrote &#8220;1.67 meters.&#8221; It isn&#8217;t &#8220;0800,&#8221; it is always &#8220;0800 on 4/18/2010.&#8221; <strong>Times are useless without dates.</strong> And if your team or customers are not all in the same time zone, and they rarely are[1], you need that information, too.</p>
<p><em>&#8220;The system shutdowns will commence at 2200 on 4/17/2010, the power will be disconnected at 0000 on 4/18/2010, and power-ups will occur again at 0800 on 4/18/2010.</em><em> All times are in CDT (-0500).&#8221;</em></p>
<p><em>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;</em></p>
<p>[0] Yes, I am aware there are sometimes leap seconds, which get added to the end of a day, thus causing a 23:59:60. 99.99%+ of all outage planning does not need to take this into account.</p>
<p>[1] And even if they are, it doesn&#8217;t hurt to add that information.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2666&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/10/03/cihost-sucks-and-i-need-another-host/" rel="bookmark">CIHost Sucks and I Need Another Host</a><!-- (26.4591)--></li>
		<li><a href="http://lonesysadmin.net/2010/08/29/vmworld-2010-saturday/" rel="bookmark">VMworld 2010: Saturday</a><!-- (22.5834)--></li>
		<li><a href="http://lonesysadmin.net/2007/05/14/maintenance-windows/" rel="bookmark">What? Maintenance Windows?</a><!-- (20.0504)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/06/14/midnight-is-always-tomorrow/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>What are P-states and how do I use them in vSphere?</title>
		<link>http://lonesysadmin.net/2010/05/21/what-are-p-states-and-how-do-i-use-them-in-vsphere/</link>
		<comments>http://lonesysadmin.net/2010/05/21/what-are-p-states-and-how-do-i-use-them-in-vsphere/#comments</comments>
		<pubDate>Fri, 21 May 2010 18:08:34 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2588</guid>
		<description><![CDATA[VMware vSphere 4 added the ability to take advantage of Intel SpeedStep and AMD PowerNow! CPU power management features. These features are commonly known as &#8220;Dynamic Voltage and Frequency Scaling&#8221; or DVFS, and let an OS cooperate with the CPU to reduce power consumption by reducing the frequency of the CPU and the voltage at [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/11/12/intel-releases-the-xeon-5200-5400-processors/" rel="bookmark">Intel Releases the Xeon 5200 &#038; 5400 Processors</a><!-- (41.0864)--></li>
		<li><a href="http://lonesysadmin.net/2006/05/12/vmware-l2-cache/" rel="bookmark">VMware L2 cache</a><!-- (35.0963)--></li>
		<li><a href="http://lonesysadmin.net/2009/03/31/intel-xeon-5500-release/" rel="bookmark">Intel Xeon 5500 Release</a><!-- (34.2465)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>VMware vSphere 4 added the ability to take advantage of Intel SpeedStep and AMD PowerNow! CPU power management features. These features are commonly known as &#8220;Dynamic Voltage and Frequency Scaling&#8221; or DVFS, and let an OS cooperate with the CPU to reduce power consumption by reducing the frequency of the CPU and the voltage at which it is operating. It reduces these things in preset tiers, and these tiers are known as P-states. On Intel CPUs they are trademarked as &#8220;SpeedStep&#8221; and on AMD they are either &#8220;Cool&#8217;n'Quiet&#8221; or &#8220;PowerNow!&#8221;</p>
<p>The Wikipedia article on <a href="http://en.wikipedia.org/wiki/SpeedStep">Intel SpeedStep</a> points out that &#8220;power consumed by a CPU with a capacitance of C, running at voltage V, and frequency f is approximately <em>P</em> = <em>C</em><em>V</em><sup>2</sup><em>f</em>.&#8221;  This means if you can reduce the voltage to the CPU the power needs drop in a non-linear fashion. Furthermore, many electronic components run more efficiently at lower temperatures, and since consuming less power means less heat generated you end up seeing efficiency gains within the host as well as reduced load on data center cooling. This results in an overall reduced power bill, and potential savings in related systems like a UPS, generators, etc.</p>
<p>Frequency and voltage in a CPU are correlated. So are instructions per second and frequency. Basically, if you want your CPU to get more work done per second you need to increase the frequency it runs at, and to do that you need to increase the voltage. So why would you want to turn the CPU&#8217;s performance down in the first place? The thing is, CPUs are much faster than everything else in a computer system. If the CPU needs data for an operation it&#8217;ll look in cache. L1 cache operates at the CPU speed &#8212; fast but small. L2 cache operates at a fraction of the CPU speed, but still many times faster than RAM[1]. The problem is when the CPU needs data that isn&#8217;t found in cache and has to go to RAM or disk. Going to RAM means it&#8217;ll wait for thousands of clock cycles before the data is returned, because RAM is much slower than the CPU. Going to disk or network means waiting for millions of clock cycles, which is an eternity to a CPU. So while the system may be busy, the CPU might actually be idle, and that&#8217;s a great time to stop using power and generating heat.</p>
<p>When one process is doing I/O like that it&#8217;s also a good time for the hypervisor in vSphere (or scheduler in a regular OS) to run something else. That &#8220;something else&#8221; might not need the full performance of the CPU, either, and the frequency &amp; voltage of the CPU can be decreased to save power in that case, too.</p>
<p>Given that all this trouble has been taken to add this feature to hardware and software, how do you turn it on?</p>
<p>1. Make sure your CPUs have this feature. According to VMware vCenter, under Configuration-&gt;Processors, my sample Dell PowerEdge R610 has Intel E5530 CPUs. I can check that by looking at Intel&#8217;s product web site, <a href="http://ark.intel.com/">ark.intel.com</a>, under &#8220;Xeon&#8221; processors.</p>
<p>2. If, in vCenter, under Configuration-&gt;Processors it has something like &#8220;Enhanced Intel SpeedStep&#8221; listed by &#8220;Power Management Technology&#8221; then you can proceed to step 3. If it says &#8220;Not Available&#8221; or something else you may need to set your BIOS to allow operating system control of the power management. On my Dell PowerEdge R610 the option is under Power Management. Set it to &#8220;OS Control&#8221; as:</p>
<p><a href="http://lonesysadmin.net/wp-content/uploads/2010/05/1937f699f8a4e1164e98fe21458303c0.png"><img class="alignnone size-full wp-image-2595" title="Dell R610 Bios 1.3.6 - Power Management" src="http://lonesysadmin.net/wp-content/uploads/2010/05/1937f699f8a4e1164e98fe21458303c0.png" alt="Dell R610 Bios 1.3.6 - Power Management" width="504" height="282" /></a></p>
<p>On some older models, like the PowerEdge R900, it&#8217;s in the CPU options and called &#8220;Demand-Based Power Management.&#8221;</p>
<p>3. Go back in vCenter. By now the Power Management Technology should be populated with something other than &#8220;Not Available&#8221; (if that isn&#8217;t the case then check with your hardware vendor). If that&#8217;s set, go to Configuration-&gt;Advanced Settings, then Power, and change Power.CpuPolicy to &#8220;dynamic.&#8221;</p>
<p><a href="http://lonesysadmin.net/wp-content/uploads/2010/05/vSphere-Advanced-Settings-Power.png"><img class="alignnone size-full wp-image-2596" title="vSphere Advanced Settings - Power" src="http://lonesysadmin.net/wp-content/uploads/2010/05/vSphere-Advanced-Settings-Power.png" alt="vSphere Advanced Settings - Power" width="522" height="434" /></a></p>
<p>4. Say OK and you&#8217;re set.</p>
<p>I&#8217;ve added this to my checklist for bringing a new ESX host online now, and now that I&#8217;ve got it enabled I&#8217;m watching the power consumption a lot more closely. Can I tell a difference? Hard to say right now, as I don&#8217;t have enough new data for my small clusters. It still doesn&#8217;t replace Dynamic Power Management (DPM), because if you genuinely don&#8217;t need the capacity of a host shutting it completely off makes the most sense. But in the effort to be greener, every little bit helps, and it&#8217;s easy to enable.</p>
<p>As always, if I&#8217;ve made a mistake or you&#8217;d like to add relevant information just make a comment below. I read all my comments!</p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;</p>
<p>[1] This is why larger L1 &amp; L2 caches are better, why prefetchers exist (to try prepopulating the caches with data the CPU might need), why architectures like Intel&#8217;s Nehalem add L3 caches that are shared among the cores, and why hypervisors try to schedule the same process on the same CPUs when they can (CPU affinity increases the chance that useful data is still in the caches). It&#8217;s all a big effort to keep the CPUs from waiting.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2588&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2007/11/12/intel-releases-the-xeon-5200-5400-processors/" rel="bookmark">Intel Releases the Xeon 5200 &#038; 5400 Processors</a><!-- (41.0864)--></li>
		<li><a href="http://lonesysadmin.net/2006/05/12/vmware-l2-cache/" rel="bookmark">VMware L2 cache</a><!-- (35.0963)--></li>
		<li><a href="http://lonesysadmin.net/2009/03/31/intel-xeon-5500-release/" rel="bookmark">Intel Xeon 5500 Release</a><!-- (34.2465)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/05/21/what-are-p-states-and-how-do-i-use-them-in-vsphere/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>If You Don&#8217;t Like Change&#8230;</title>
		<link>http://lonesysadmin.net/2010/05/20/if-you-dont-like-change/</link>
		<comments>http://lonesysadmin.net/2010/05/20/if-you-dont-like-change/#comments</comments>
		<pubDate>Thu, 20 May 2010 19:32:40 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Quotes]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2589</guid>
		<description><![CDATA[&#8220;If you don&#8217;t like change, you&#8217;re going to like irrelevance even less.&#8221; &#8211; General Eric Shinseki, former United States Army Chief of Staff. This post written by Bob Plankers for The Lone Sysadmin. Unless otherwise noted it is &#169; 2010 Bob Plankers and licensed under the Creative Commons BY-NC-SA 3.0 license. Related Posts Terms of [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/terms-of-use/" rel="bookmark">Terms of Use</a><!-- (73.5011)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/01/license-changes/" rel="bookmark">License Changes</a><!-- (44.7083)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/16/creative-commons/" rel="bookmark">Creative Commons</a><!-- (43.276)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>&#8220;If you don&#8217;t like change, you&#8217;re going to like irrelevance even less.&#8221; &#8211; General Eric Shinseki, former United States Army Chief of Staff.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2589&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/terms-of-use/" rel="bookmark">Terms of Use</a><!-- (73.5011)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/01/license-changes/" rel="bookmark">License Changes</a><!-- (44.7083)--></li>
		<li><a href="http://lonesysadmin.net/2008/02/16/creative-commons/" rel="bookmark">Creative Commons</a><!-- (43.276)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/05/20/if-you-dont-like-change/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Power Consumption of a Dell PowerEdge R610</title>
		<link>http://lonesysadmin.net/2010/05/13/power-consumption-of-a-dell-poweredge-r10/</link>
		<comments>http://lonesysadmin.net/2010/05/13/power-consumption-of-a-dell-poweredge-r10/#comments</comments>
		<pubDate>Thu, 13 May 2010 16:44:14 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2570</guid>
		<description><![CDATA[For planning purposes I just did some power draw testing of a Dell PowerEdge R610. Dual Intel X5550 CPUs, 24 GB of RAM, four SSD disks attached to the PERC6/i, and dual 717 Watt power supplies. My testing methodology was to measure the draw using a Fluke 322 clamp meter, both at idle and running [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2010/03/30/dell-poweredge-r610-perc6i-disk-comparison/" rel="bookmark">Dell PowerEdge R610 &#038; PERC/6i Disk Comparison</a><!-- (55.0423)--></li>
		<li><a href="http://lonesysadmin.net/2005/09/23/configuring-and-securing-ipmi-on-dell-poweredge-x8xx-hardware/" rel="bookmark">Configuring and Securing IPMI on Dell PowerEdge x8xx Hardware</a><!-- (39.2302)--></li>
		<li><a href="http://lonesysadmin.net/2007/05/29/nagios-sun-t2000-vs-dell-poweredge-2950/" rel="bookmark">Nagios: Sun T2000 vs. Dell PowerEdge 2950</a><!-- (38.4127)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>For planning purposes I just did some power draw testing of a Dell PowerEdge R610. Dual Intel X5550 CPUs, 24 GB of RAM, four SSD disks attached to the PERC6/i, and dual 717 Watt power supplies. My testing methodology was to measure the draw using a Fluke 322 clamp meter, both at idle and running a <a href="http://freshmeat.net/projects/stress">stress</a> test under Red Hat Enterprise Linux 5 (stress -c 32 -d 8 -i 8 -m 16). I did this with one and two power supplies active.</p>
<p>1 PS, idle: 0.65 Amps @ 202.3 Volts = 131.5 Watts</p>
<p>1 PS, loaded: 1.51 Amps @ 202.3 Volts = 305.5 Watts</p>
<p>2 PS, idle: 0.35 Amps @ 202.3 Volts = 70.8 Watts each (total of 141.6 Watts)</p>
<p>2 PS, loaded: 0.77 Amps @ 202.3 Volts = 155.8 Watts each (total of 311.6 Watts)</p>
<p>Virtualization users who didn&#8217;t see the VCritical commentary on &#8220;<a href="http://www.vcritical.com/2010/05/idle-rhev-hypervisors-save-power/r">Idle RHEL Hypervisors save power?</a>&#8221; might want to check that out, since these numbers directly support Mr. Gray&#8217;s argument. An idle server drawing 45% of the power of a loaded server is a pretty solid argument for VMware DPM.</p>
<p>Also note that the label on the power supplies indicates these are 717 watt power supplies, while the maximum draw I recorded was less than half that. While building in some overhead is a good idea, using the labels to determine draw isn&#8217;t a great idea, because you <em>will </em>overbuild your infrastructure.</p>
<p><strong>Update:</strong> As was pointed out in the comments, you can use the Dell iDRAC web interface to find out the current power consumption of the 11th generation Dells. However, it looks like that has accuracy problems, particularly at the low end (idle), but is relatively usable at the top end. If you&#8217;re serious about measuring this stuff I&#8217;d still get a meter, though.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2570&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2010/03/30/dell-poweredge-r610-perc6i-disk-comparison/" rel="bookmark">Dell PowerEdge R610 &#038; PERC/6i Disk Comparison</a><!-- (55.0423)--></li>
		<li><a href="http://lonesysadmin.net/2005/09/23/configuring-and-securing-ipmi-on-dell-poweredge-x8xx-hardware/" rel="bookmark">Configuring and Securing IPMI on Dell PowerEdge x8xx Hardware</a><!-- (39.2302)--></li>
		<li><a href="http://lonesysadmin.net/2007/05/29/nagios-sun-t2000-vs-dell-poweredge-2950/" rel="bookmark">Nagios: Sun T2000 vs. Dell PowerEdge 2950</a><!-- (38.4127)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/05/13/power-consumption-of-a-dell-poweredge-r10/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Dell PowerEdge R610 &amp; PERC/6i Disk Comparison</title>
		<link>http://lonesysadmin.net/2010/03/30/dell-poweredge-r610-perc6i-disk-comparison/</link>
		<comments>http://lonesysadmin.net/2010/03/30/dell-poweredge-r610-perc6i-disk-comparison/#comments</comments>
		<pubDate>Tue, 30 Mar 2010 19:48:46 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2486</guid>
		<description><![CDATA[I&#8217;ve recently done some very basic disk performance testing of a Dell PowerEdge R610 with 24 GB of RAM (1333 MHz), dual Intel X5550 CPUs, a PERC/6i RAID controller, and a bunch of 146 GB 15K RPM 2.5&#8243; disks, as well as four of the Dell 50 GB enterprise SSD disks (which are Samsung drives). [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2006/11/16/dell-perc-5i-not-caching/" rel="bookmark">Dell PERC 5/i Not Caching</a><!-- (69.2463)--></li>
		<li><a href="http://lonesysadmin.net/2010/05/13/power-consumption-of-a-dell-poweredge-r10/" rel="bookmark">Power Consumption of a Dell PowerEdge R610</a><!-- (59.1183)--></li>
		<li><a href="http://lonesysadmin.net/2007/07/19/red-hat-broke-by-fixing-nic-detection-on-dell-x9xx/" rel="bookmark">Red Hat broke (by fixing) NIC detection on Dell PowerEdge 2950s</a><!-- (38.4017)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve recently done some very basic disk performance testing of a Dell PowerEdge R610 with 24 GB of RAM (1333 MHz), dual Intel X5550 CPUs, a PERC/6i RAID controller, and a bunch of 146 GB 15K RPM 2.5&#8243; disks, as well as four of the Dell 50 GB enterprise SSD disks (which are Samsung drives). I tested various combinations of RAID 0, 1, 5, 6, 10, and 50 with 1, 2, 3, 4, and 6 disks.</p>
<p>While the RAID controller configurations varied, all the configs had the element size set to 64 KB, read policy set to Adaptive Read Ahead, and write policy set to Write Back. The PERC/6i firmware was 6.2.0-013. The operating system was Red Hat Enterprise Linux 5 Update 4, 64-bit, updated to the latest patches as of 3/10/2010. The filesystems were all LVM-based ext3 filesystems, formatted with &#8220;mke2fs -j -m 0 -O dir_index.&#8221; I used the benchmarking command bonnie++, in the form &#8220;bonnie++ -r 32768&#8243; to indicate that I had 32 GB of RAM (though I had 24, this ensures that writes and reads are larger than the cache, so caching has a negligible effect on the results). I ran each test three times and averaged the results.</p>
<p>There is a big surprise in this data, which I will have to revisit: the sequential block read performance for 6 disk RAID5. Is that an anomaly in my configuration, or is it really that fast? I will need to revisit that when I set the test environment up again. I would expect results more consistent with RAID6 read performance, but perhaps RAID6 isn&#8217;t as mature as the RAID5 algorithms.</p>
<p>This isn&#8217;t as complete as it could be, and other disk benchmarks, like  iozone, do a better job of characterizing disk performance with random  workloads, where the SSD would likely do much better. There are also  newer disk controllers out there, namely the Dell H700 with 6 Gbps SAS links, that may improve  on these scores. But it&#8217;s what I needed for something I&#8217;m doing, and if it helps someone else I&#8217;m glad I posted it.</p>
<p>Click on the graphic for a larger version that&#8217;s more readable.</p>
<p><a href="http://lonesysadmin.net/wp-content/uploads/2010/03/RAID-Perf_27920_image001.gif"><img class="alignnone size-full wp-image-2495" title="RAID Performance on R610 &amp; PERC/6i" src="http://lonesysadmin.net/wp-content/uploads/2010/03/RAID-Perf_27920_image001.gif" alt="" width="552" height="403" /></a></p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2486&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2006/11/16/dell-perc-5i-not-caching/" rel="bookmark">Dell PERC 5/i Not Caching</a><!-- (69.2463)--></li>
		<li><a href="http://lonesysadmin.net/2010/05/13/power-consumption-of-a-dell-poweredge-r10/" rel="bookmark">Power Consumption of a Dell PowerEdge R610</a><!-- (59.1183)--></li>
		<li><a href="http://lonesysadmin.net/2007/07/19/red-hat-broke-by-fixing-nic-detection-on-dell-x9xx/" rel="bookmark">Red Hat broke (by fixing) NIC detection on Dell PowerEdge 2950s</a><!-- (38.4017)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/03/30/dell-poweredge-r610-perc6i-disk-comparison/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Never Send Error Email in a Loop</title>
		<link>http://lonesysadmin.net/2010/03/24/never-send-error-email-in-a-loop/</link>
		<comments>http://lonesysadmin.net/2010/03/24/never-send-error-email-in-a-loop/#comments</comments>
		<pubDate>Wed, 24 Mar 2010 15:19:31 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2458</guid>
		<description><![CDATA[Some of my favorite system outages are denial-of-service attacks brought on by coders who code as if nothing will ever go wrong. For instance, take the following section of pseudocode: foreach $email (@giant_list_of_customer_email_addrs) { @customer_info = get_database_info_for_customer($email); if (!defined(@customer_info)) { send_error_email_to_admins($email); } else { send_customer_email(@customer_info); undef(@customer_info); } } When get_database_info_for_customer() fails (such as when the [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2006/03/22/i-hate-programmatic-email/" rel="bookmark">I Hate Programmatic Email</a><!-- (40.4602)--></li>
		<li><a href="http://lonesysadmin.net/2007/05/02/how-a-sysadmin-got-his-email-under-control-2/" rel="bookmark">How A Sysadmin Got His Email Under Control</a><!-- (34.2045)--></li>
		<li><a href="http://lonesysadmin.net/2009/03/06/useful-error-messages/" rel="bookmark">Useful Error Messages</a><!-- (32.6824)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>Some of my favorite system outages are denial-of-service attacks brought on by coders who code as if nothing will ever go wrong. For instance, take the following section of pseudocode:</p>
<pre>foreach $email (@giant_list_of_customer_email_addrs) {
    @customer_info = get_database_info_for_customer($email);
    if (!defined(@customer_info)) {
        send_error_email_to_admins($email);
    } else {
        send_customer_email(@customer_info);
        undef(@customer_info);
    }
}
</pre>
<p>When get_database_info_for_customer() fails (such as when the database is down for maintenance), someone will get an email for every failure. This is merely annoying when @giant_list_of_customer_email_addrs is 50 people, but when it&#8217;s 200,000 people it&#8217;s a big problem. First, you get hundreds of copies of sendmail running (or whatever the mailer function uses &#8212; with a lazy coder like this it&#8217;ll usually be something that isn&#8217;t efficient at all). Second, your local SMTP server gets overwhelmed, and its spam &amp; antivirus scanners start melting down. Third, your mail spool fills, which causes other problems, like bounce messages that exacerbate the situation. And now, instead of having your database server down for maintenance and a scheduled script that should have run but couldn&#8217;t, you also have some of your other systems down as part of an incident. Whoops.</p>
<p>Never send error email in a loop. If you need to notify someone that a script had a failure do it outside of the loop, and use a stack to keep track of the errors that were encountered to be listed in the error message. Certainly, in this case, detecting a database problem and exiting would have helped, too. However, with all the things that could go wrong with databases, customer information, and general programmatic input it&#8217;s still worthwhile to avoid the inadvertent DoS by moving error notification outside the loop. At least that way you only risk having a single, potentially giant, email. :)</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2458&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2006/03/22/i-hate-programmatic-email/" rel="bookmark">I Hate Programmatic Email</a><!-- (40.4602)--></li>
		<li><a href="http://lonesysadmin.net/2007/05/02/how-a-sysadmin-got-his-email-under-control-2/" rel="bookmark">How A Sysadmin Got His Email Under Control</a><!-- (34.2045)--></li>
		<li><a href="http://lonesysadmin.net/2009/03/06/useful-error-messages/" rel="bookmark">Useful Error Messages</a><!-- (32.6824)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/03/24/never-send-error-email-in-a-loop/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Future Capacity Planning</title>
		<link>http://lonesysadmin.net/2010/01/21/future-capacity-planning/</link>
		<comments>http://lonesysadmin.net/2010/01/21/future-capacity-planning/#comments</comments>
		<pubDate>Thu, 21 Jan 2010 22:10:17 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2392</guid>
		<description><![CDATA[My favorite question from manager types is: &#8220;How many more VMs can we run before we have to expand?&#8221; I can never answer this without someone sticking it to me later. I always do end up answering it, and my answer is always wrong because it&#8217;s based on averages and the very little I&#8217;m told [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2009/04/07/how-much-capacity-should-i-have-for-vmware/" rel="bookmark">How Much Capacity Should I Have For VMware?</a><!-- (27.8152)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/12/get-vmworld-into-your-budgets/" rel="bookmark">Get VMworld Into Your Budgets</a><!-- (25.9685)--></li>
		<li><a href="http://lonesysadmin.net/2009/04/15/what-kind-of-servers-should-i-buy-for-vmware/" rel="bookmark">What Kind Of Servers Should I Buy for VMware?</a><!-- (20.2187)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>My favorite question from manager types is:</p>
<p>&#8220;How many more VMs can we run before we have to expand?&#8221;</p>
<p>I can never answer this without someone sticking it to me later. I always do end up answering it, and my answer is always wrong because it&#8217;s based on averages and the very little I&#8217;m told about future projects, upcoming P2Vs, server replacements, etc. We aren&#8217;t going to get 25 more 1.28 vCPU/2.398 GB of RAM VMs, though. It&#8217;s like having 1.75 kids &#8212; it just doesn&#8217;t work that way. I could try to tell them that we have 108 GB of RAM available, but that isn&#8217;t what they want, either. They want a concrete number they can multiply by our chargeback rates and put in the budget.</p>
<p>It&#8217;s hard to explain the problem with all of this, though, and I&#8217;ve been searching for a good analogy to make people realize why I&#8217;m so cagy about an answer. My awesome financial analyst, Michelle Fritze, <em>just</em> came up with it:</p>
<p>&#8220;How many boxes fit in your office?&#8221;</p>
<p>I can&#8217;t wait to ask my CIO that.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2392&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2009/04/07/how-much-capacity-should-i-have-for-vmware/" rel="bookmark">How Much Capacity Should I Have For VMware?</a><!-- (27.8152)--></li>
		<li><a href="http://lonesysadmin.net/2008/12/12/get-vmworld-into-your-budgets/" rel="bookmark">Get VMworld Into Your Budgets</a><!-- (25.9685)--></li>
		<li><a href="http://lonesysadmin.net/2009/04/15/what-kind-of-servers-should-i-buy-for-vmware/" rel="bookmark">What Kind Of Servers Should I Buy for VMware?</a><!-- (20.2187)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/01/21/future-capacity-planning/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>WebEx &amp; Aero</title>
		<link>http://lonesysadmin.net/2010/01/15/webex-aero/</link>
		<comments>http://lonesysadmin.net/2010/01/15/webex-aero/#comments</comments>
		<pubDate>Fri, 15 Jan 2010 20:02:38 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[System Administration]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2386</guid>
		<description><![CDATA[WebEx and Microsoft Windows 7 don&#8217;t seem to get along 100% quite yet. If you are using WebEx on Windows 7 it&#8217;ll disable Aero during your session. However, if your session is over and you don&#8217;t get Aero back here&#8217;s how to fix it without rebooting: 1. Make sure you&#8217;ve closed/exited all WebEx components. 2. [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2009/03/24/not-running-vmware-capacity-planner-as-root-on-linux/" rel="bookmark">Not Running VMware Capacity Planner as root on Linux</a><!-- (25.0176)--></li>
		<li><a href="http://lonesysadmin.net/2007/02/06/use-at-now-to-daemonize-tasks/" rel="bookmark">Tip: Use &#8216;at now&#8217; to daemonize tasks</a><!-- (24.9031)--></li>
		<li><a href="http://lonesysadmin.net/2007/06/21/how-to-configure-ipmi-on-a-dell-poweredge-running-red-hat-enterprise-linux/" rel="bookmark">How to Configure IPMI on a Dell PowerEdge running Red Hat Enterprise Linux</a><!-- (22.0588)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>WebEx and Microsoft Windows 7 don&#8217;t seem to get along 100% quite yet. If you are using WebEx on Windows 7 it&#8217;ll disable Aero during your session. However, if your session is over and you don&#8217;t get Aero back here&#8217;s how to fix it without rebooting:</p>
<p>1. Make sure you&#8217;ve closed/exited all WebEx components.</p>
<p>2. Right click on Windows Menu-&gt;All Programs -&gt; Accessories -&gt; Command Prompt and choose &#8220;Run as administrator.&#8221; You will need to accept a User Access Control warning about this.</p>
<p>3. Issue the commands:</p>
<pre>net stop uxsms</pre>
<pre>net start uxsms</pre>
<p>That should fix it.</p>
<p>Alternately (and potentially easier): you could restart the &#8220;Desktop Window Manager Session Manager&#8221; service via the Services administrative tool.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2386&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2009/03/24/not-running-vmware-capacity-planner-as-root-on-linux/" rel="bookmark">Not Running VMware Capacity Planner as root on Linux</a><!-- (25.0176)--></li>
		<li><a href="http://lonesysadmin.net/2007/02/06/use-at-now-to-daemonize-tasks/" rel="bookmark">Tip: Use &#8216;at now&#8217; to daemonize tasks</a><!-- (24.9031)--></li>
		<li><a href="http://lonesysadmin.net/2007/06/21/how-to-configure-ipmi-on-a-dell-poweredge-running-red-hat-enterprise-linux/" rel="bookmark">How to Configure IPMI on a Dell PowerEdge running Red Hat Enterprise Linux</a><!-- (22.0588)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/01/15/webex-aero/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Playing Mastermind With My RAM</title>
		<link>http://lonesysadmin.net/2010/01/12/playing-mastermind-with-my-ram/</link>
		<comments>http://lonesysadmin.net/2010/01/12/playing-mastermind-with-my-ram/#comments</comments>
		<pubDate>Tue, 12 Jan 2010 19:46:02 +0000</pubDate>
		<dc:creator>Bob Plankers</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[General Rambling]]></category>
		<category><![CDATA[System Administration]]></category>

		<guid isPermaLink="false">http://lonesysadmin.net/?p=2340</guid>
		<description><![CDATA[I have a Dell PowerEdge R610 in one of my VMware vSphere clusters that has been reporting memory errors. In fact, the machine wouldn&#8217;t boot, and the front panel suggested I reseat all the RAM. Okay&#8230; 0. Reseat all the RAM. Didn&#8217;t work, as expected. 1. Pull all twelve DIMMs out, put four back in. [...]

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2008/10/05/intel-7400-memory-population/" rel="bookmark">Intel 7400 Memory Population</a><!-- (33.2213)--></li>
		<li><a href="http://lonesysadmin.net/2007/07/11/just-pull-the-drive/" rel="bookmark">Just Pull the Drive</a><!-- (27.9087)--></li>
		<li><a href="http://lonesysadmin.net/2008/01/04/new-laptop-on-its-way/" rel="bookmark">New Laptop On Its Way</a><!-- (20.8994)--></li>
	</ol>
]]></description>
			<content:encoded><![CDATA[<p>I have a Dell PowerEdge R610 in one of my VMware vSphere clusters that has been reporting memory errors. In fact, the machine wouldn&#8217;t boot, and the front panel suggested I reseat all the RAM. Okay&#8230;</p>
<p>0. Reseat all the RAM. Didn&#8217;t work, as expected.</p>
<p>1. Pull all twelve DIMMs out, put four back in. That worked, machine comes up.</p>
<p>2. Put four more DIMMs back. That worked, machine comes up.</p>
<p>3. Put last four DIMMs in. Machine doesn&#8217;t boot, same original error.</p>
<p>4. Pull last set of DIMMs out. Boot machine. Notice that BIOS is really old. Upgrade BIOS, thinking this is some stupid BIOS bug. Machine continues to boot.</p>
<p>5. Put last four DIMMs back in. New BIOS actually tells me what DIMMs are bad. Nice, except it says that A1 and A4 are bad. Two DIMMs? Yeah, not likely.</p>
<p>6. Order single replacement DIMM from Dell, decide to play<a href="http://www.mathsisfun.com/games/mastermind-game.html"> Mastermind</a> with RAM.</p>
<p>7. Replace DIMM A1. Machine switches to saying DIMMs B3 and B5 are bad. Really? DIMM banks B are on the other CPU.</p>
<p>8. Stifle disbelief, take loose DIMM from A1 and replace B3.</p>
<p>9. Machine switches to saying DIMM B5 is bad.</p>
<p>10. Take loose DIMM from B3 and replace B5. Machine likes that, has all of its RAM again, and I probably have the offending DIMM out now. Probably.</p>
<p>Lessons here: A) physical hardware sucks. B) linear troubleshooting rules. C) keep your firmware up to date.</p>
<p><hr></p><p>This post written by Bob Plankers for <a href="http://lonesysadmin.net">The Lone Sysadmin</a>. Unless otherwise noted it is &copy; 2010 Bob Plankers and licensed under the <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/">Creative Commons BY-NC-SA 3.0 license</a>.</p><img src="http://lonesysadmin.net/?ak_action=api_record_view&id=2340&type=feed" alt="" />

<h3>Related Posts</h3>
<ol>
		<li><a href="http://lonesysadmin.net/2008/10/05/intel-7400-memory-population/" rel="bookmark">Intel 7400 Memory Population</a><!-- (33.2213)--></li>
		<li><a href="http://lonesysadmin.net/2007/07/11/just-pull-the-drive/" rel="bookmark">Just Pull the Drive</a><!-- (27.9087)--></li>
		<li><a href="http://lonesysadmin.net/2008/01/04/new-laptop-on-its-way/" rel="bookmark">New Laptop On Its Way</a><!-- (20.8994)--></li>
	</ol>
]]></content:encoded>
			<wfw:commentRss>http://lonesysadmin.net/2010/01/12/playing-mastermind-with-my-ram/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
