<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>NearlyFreeSpeech.NET Blog &#187; Network Status</title>
	<atom:link href="http://blog.nearlyfreespeech.net/category/status/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.nearlyfreespeech.net</link>
	<description>A blog from the staff at NearlyFreeSpeech.NET.</description>
	<pubDate>Wed, 19 Nov 2008 21:22:44 +0000</pubDate>
	
	<language>en</language>
			<item>
		<title>Email forwarding changes and UI downtime</title>
		<link>http://blog.nearlyfreespeech.net/2008/11/19/email-forwarding-changes-and-ui-downtime/</link>
		<comments>http://blog.nearlyfreespeech.net/2008/11/19/email-forwarding-changes-and-ui-downtime/#comments</comments>
		<pubDate>Wed, 19 Nov 2008 21:04:09 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/?p=66</guid>
		<description><![CDATA[Over the next few days, we will be making major changes to our email forwarding service in order to make sure that we are providing a consistent, high-quality forwarding service that meets our members&#8217; expectations.  This is something we have wanted to do for awhile, but recent events have forced us to pursue a [...]]]></description>
			<content:encoded><![CDATA[<p>Over the next few days, we will be making major changes to our email forwarding service in order to make sure that we are providing a consistent, high-quality forwarding service that meets our members&#8217; expectations.  This is something we have wanted to do for awhile, but recent events have forced us to pursue a change strategy that might not have been our first choice.<br />
<span id="more-66"></span><br />
One of the major problems we have with our service is that we have two different email forwarding services, the differences between which are documented very poorly when they are documented at all.</p>
<p>The older one is called Legacy Forwarding.  It is a service we provide using our own servers, and adheres to our draconian <a href="https://www.nearlyfreespeech.net/about/email.php">email acceptance policy</a>.</p>
<p>The newer one is simply called Email Forwarding.  It is a service we provide in conjunction with a partnership with a third-party company.  It is supposed to work a little differently; we call it a &#8220;hybrid&#8221; forwarding system because it combines certain aspects of a mailbox with certain aspects of email forwarding.  Their servers are willing to accept virtually any email, but they then perform content-filtering checks and are supposed to divert suspect messages into a special &#8220;spam quarantine&#8221; accessible via our member interface.</p>
<p>I say &#8220;supposed to&#8221; because at some point in the recent past, without consulting with or notifying us, the third-party company unilaterally changed their system to discard all spam-suspect messages instead of placing them into the quarantine folder.  At first we thought this was isolated to a few domains, but we have since discovered that the change was global.  This means that for all of our members who have &#8220;Email Forwarding,&#8221; any messages you receive that are believed by their system to be spam, including false positives, are currently being silently discarded.  This is absolutely not acceptable to us, but our efforts to redress this with them have not been successful.</p>
<p>Ultimately, our email forwarding members are paying us for email forwarding services, not a third party company, so it is our responsibility to make sure our members get the service they are paying for.  To that end, we are taking the following steps:</p>
<ul>
<li>We will stop billing for all email forwarding domains until we feel things are working the way we intend.</li>
<li>We are deploying a new hybrid email forwarding system, described below, which will be applied to all members&#8217; email forwarding domains.</li>
<li>Because we need to make changes, we will be temporarily disabling the email forwarding portions of our UI. <strong>We are not disabling email forwarding, just the UI that controls it.</strong></li>
</ul>
<p>The new email forwarding system will apply all of our existing email acceptance policy criteria, but it will do so a little differently.  As with our legacy forwarding system, messages that meet all of our criteria will sail through the system with no disruption or delay, as will messages sent by domains with proper SPF configurations, even if they have other problems.  Also as with the legacy system, messages from blacklisted servers will be immediately rejected.  </p>
<p>For messages sent by servers with improper or no SPF and one or more other configuration problems, instead of bouncing them as the legacy forwarding system does, we will be trying a technique called greylisting, which tells the server sending the message to try again in a few hours.  If the sending server does that, the message will be accepted.  </p>
<p>We will also be adding a spam/virus checking component that will reject email containing viruses and quarantine egregious spam that somehow passes all of the above filters.  (The current email forwarding scheme quarantines both spam and viruses.)  Also new will be a feature that places post-forwarding bounce messages into the same quarantine, so if there are problems with delivering forwarded messages to you, you will be better able to detect and resolve them.</p>
<p>We feel that this system will make the best trade-offs between meeting our members&#8217; need for reliable forwarding, keeping our forwarding mail servers off of global blacklists, and making life absolutely as miserable and unrewarding as we possibly can for spammers.  It will also let us have one forwarding service for all members, and document it appropriately.</p>
<p>All of our email forwarding uses MX records in the nearlyfreespeech.net domain, so we retain full control over how they are handled.  The above system is still in final development, but we believe it can be completed and deployed without any major disruption to anyone&#8217;s forwarded email.  Due to the way Internet email works, even if we do wind up needing to take forwarding offline for a few hours in the middle of the night, messages will simply queue and redeliver when it comes back up.</p>
<p>We need to disable the member UI to work on it because we need the overall configuration to hold still long enough for us to update the UI code and move everyone over to the new system.  It is our goal to have the entire transition completed by Saturday.  Since we won&#8217;t be billing for email forwarding until this is resolved, we will be very highly motivated to make it happen as quickly and accurately as possible.  If for any reason you do need to make an urgent email forwarding change that just can&#8217;t wait, or if you want to remove it altogether, just drop us a <a href="https://members.nearlyfreespeech.net/support/request">Secure Support Request</a> and we&#8217;ll do our best to take care of you.</p>
<p>Since this is still an evolving situation, the above reflects what we want to do, and what we reasonably believe we can do.  If and when circumstances change, we will update you accordingly.  These changes will not affect the cost of email forwarding, and while they may ultimately put us in a more flexible position with respect to offering email hosting in the future, that is not a goal of this project and nothing is changing on that front at this time.</p>
<p>I apologize for the short notice and rushed character of this change.  We spent a lot of effort (probably too much) trying to come to a less dramatic solution, and we were entirely unsuccessful.  </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2008/11/19/email-forwarding-changes-and-ui-downtime/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Friday night maintenance window for MySQL</title>
		<link>http://blog.nearlyfreespeech.net/2008/10/01/friday-night-maintenance-window-for-mysql/</link>
		<comments>http://blog.nearlyfreespeech.net/2008/10/01/friday-night-maintenance-window-for-mysql/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 20:15:14 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/?p=63</guid>
		<description><![CDATA[As previously announced, we are proceeding with a maintenance window on Friday night between 10pm and 1am Arizona/Pacific time (1am and 4am Eastern, 5am and 8am UTC).
The primary purpose of this maintenance window will be to add RAM and CPU power to some of our MySQL servers.  Just under 40% of member MySQL processes [...]]]></description>
			<content:encoded><![CDATA[<p>As <a href="http://blog.nearlyfreespeech.net/2008/09/29/mysql-freebsd-7-zfs-beta-update-mysql-upgrades/">previously announced</a>, we are proceeding with a maintenance window on Friday night between 10pm and 1am Arizona/Pacific time (1am and 4am Eastern, 5am and 8am UTC).</p>
<p>The primary purpose of this maintenance window will be to add RAM and CPU power to some of our MySQL servers.  Just under 40% of member MySQL processes will be affected, and we estimate the total time affected will be about 30 - 45 minutes.<br />
<span id="more-63"></span><br />
You can tell if you are likely to be affected if your MySQL @@hostname variable is &#8220;m2&#8243; or &#8220;m3.&#8221;  However, while we don&#8217;t plan to move any MySQL processes between now and then, we do reserve the right to move them at any time for any reason if appropriate to maintain network stability and performance.</p>
<p>This upgrade will increase total RAM available to member MySQL processes by 60%, with a corresponding increase in CPU power.  After the upgrade is complete, we will be able to finish the migration to FreeBSD 7 and the combined result will be better MySQL performance for all.</p>
<p>At the same time, we will be adding RAM and CPU cores to the hosting cluster, but that should not cause downtime since we can rotate the affected servers out of production prior to the upgrade and redistribute the load; there is plenty of spare capacity at that particular time.</p>
<p>As always, we appreciate your patience as we update and upgrade our network to make your hosting better, faster, and more reliable.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2008/10/01/friday-night-maintenance-window-for-mysql/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Scheduled Maintenance: IP Renumbering In Progress</title>
		<link>http://blog.nearlyfreespeech.net/2008/09/11/scheduled-maintenance-ip-renumbering-in-progress/</link>
		<comments>http://blog.nearlyfreespeech.net/2008/09/11/scheduled-maintenance-ip-renumbering-in-progress/#comments</comments>
		<pubDate>Thu, 11 Sep 2008 11:55:46 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/?p=47</guid>
		<description><![CDATA[We are in the process of renumbering out of our current IP addresses and into a larger block of addresses.  The effect of this change will be better routing performance and more room for us to grow and expand our service.

In order to minimize disruption, we plan to bring up the new IP addresses [...]]]></description>
			<content:encoded><![CDATA[<p>We are in the process of renumbering out of our current IP addresses and into a larger block of addresses.  The effect of this change will be better routing performance and more room for us to grow and expand our service.<br />
<span id="more-47"></span><br />
In order to minimize disruption, we plan to bring up the new IP addresses in parallel with the old ones and then, once everything is confirmed working, renumber all of our public-facing services.  As part of that process, we will automatically renumber all relevant DNS records hosted on our service.  If you use third party DNS, you will need to renumber manually in the near future.  However, we will provide a transitional period for you to do so.  During that period, we will automatically detect people using the old addresses and notify the site operators accordingly.  Once the transitional period is complete, we will phase out the old addresses.</p>
<p>We are pursuing this renumbering very aggressively.  Partly this is because, although we waited a year to <strong>get</strong> these IP addresses, we have a very limited window in which to renumber out of the old ones.  Such is the nature of bureaucracy.  But we have accelerated even more because we have discovered routing issues with an upstream provider related to the IP block we are in, which we believe was responsible for the disruptions some people experienced last week.  Unfortunately, we got the IP addresses we currently have from them, so we are not in a position to tell them what to do with them.  The new IPs are a direct allocation from ARIN (the American Registry of Internet Numbers) to our network operator, and thus do not have competing interests.</p>
<p>This also means that, while we may add addresses (as conservatively as possible), there should be no reason we will ever need to renumber out of existing ones for the foreseeable future once this migration is complete.</p>
<p>Unfortunately, this change is not without downsides.  As stated above, we are attempting to minimize disruption.  However, &#8220;minimized disruption&#8221; is not the same as &#8220;no disruption.&#8221;  This change requires us to make major configuration changes to each and every server, switch, router, and firewall on our network.  When it comes to routing, even the tiniest glitch or hiccup can throw everything into disarray for several minutes or more.  (Something of that nature happened earlier this morning, for which I apologize.  I likewise acknowledge that the blog post about the scheduled maintenance is <em>supposed</em> to happen <strong>before</strong> the scheduled maintenance.  My bad.)  Since we are only human, and we are very much working without a net, we are announcing scheduled maintenance between 2am and 6am EDT (6am to 10am UTC) for the next seven days.  This does <strong>not</strong> mean that our service will be down at those times, or even a fraction of them.  It simply means we will be performing configuration changes during those times, leading to a significantly increased risk of network downtime, and we want to make you aware of this.  These times were chosen by statistical measurement of our network usage; usage during this period is much lower than at any other equivalent period.</p>
<p>As a nerdy little side note, we now have native IPv6 available at the edge of our network and, as part of this change, we will be configuring our core network paths to support it.  This doesn&#8217;t mean we&#8217;re about to offer IPv6 hosting, but it does mean we won&#8217;t have to make those changes later on. </p>
<p>And for those that want to know, NearlyFreeSpeech.NET&#8217;s new IP block is 208.94.116.0/23.  Get used to it, you&#8217;ll be seeing it in traceroutes soon, and it&#8217;ll be around for a good long while.</p>
<p>This is only one of the projects currently underway to improve the core infrastructure of our service, with an emphasis on making it faster and more reliable for everyone (you may have noticed that new feature development is at a standstill while we do that).  While we&#8217;re working on this project, we apologize for any inconvenience or downtime that results, we thank you for your patience, and we promise to do our best to make the migration as smooth and trouble-free as possible.  We have one goal, and that&#8217;s to make our web hosting service amazingly good.  This will help.</p>
<p>We will try to update this blog post as we go, so we can keep you posted on how far along we are and how it&#8217;s going.  In the mean time, to preserve the signal-to-noise ratio, please keep comments on-topic and remember that due to all that&#8217;s going on, we&#8217;re super busy right now and may be somewhat slow to approve and respond to them.</p>
<p>Update 2008-09-14: We&#8217;re making good progress.  Most of our firewall/edge network has been converted, and a lot of small subnets have been retired.  We are on track to finish IPv4 renumbering on schedule.  We have also, just as an oh-so-beta proof of concept, put up IPv6 versions of our two main sites: <a href="https://www.ipv6.nearlyfreespeech.net/">www.ipv6.nearlyfreespeech.net</a> and <a href="https://members.ipv6.nearlyfreespeech.net/">members.ipv6.nearlyfreespeech.net</a>.  (You&#8217;ll get certificate warnings if you&#8217;re one of the lucky few who can access these at all.)</p>
<p>Update 2008-09-18: We continue to make progress.  New sites started being created in the new IP range this morning, and DNS and SMTP servers have also been cut over.  We will be migrating existing sites to the new range over the next day or two.  This will not cause downtime, as both the old IPs and the new ones work for all sites during the transition.  We are now running about two days behind because of the DDOS attack experienced earlier this week; one maintenance window was lost to dealing with the attack, and a second one due to recovering from dealing with the attack.</p>
<p>Update 2008-02-22: The first phase of this project did complete successfully.  The second phase of the project will require us to detect and contact people who have hardcoded the old IP addresses.  The second phase has no operational impact and will not require maintenance windows.  We will add a new blog post or an update when the second phase is complete and we are ready to begin final turn-down of the old address ranges.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2008/09/11/scheduled-maintenance-ip-renumbering-in-progress/feed/</wfw:commentRss>
		</item>
		<item>
		<title>SSH RSA host key changed</title>
		<link>http://blog.nearlyfreespeech.net/2008/05/17/ssh-rsa-host-key-changed/</link>
		<comments>http://blog.nearlyfreespeech.net/2008/05/17/ssh-rsa-host-key-changed/#comments</comments>
		<pubDate>Sat, 17 May 2008 20:42:11 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/?p=46</guid>
		<description><![CDATA[After a review of our security practices brought on by the recent Debian ssh key security issue (which does not affect our FreeBSD-based service), we&#8217;ve decided to upgrade the strength of our RSA ssh host key from 1024 bits to the same length that we recommend you use, 4096 bits.
The correct/current RSA host key:
fc:89:a1:64:74:70:a4:82:58:c1:73:4e:72:59:63:56

This naturally [...]]]></description>
			<content:encoded><![CDATA[<p>After a review of our security practices brought on by the recent Debian ssh key security issue (which does <em>not</em> affect our FreeBSD-based service), we&#8217;ve decided to upgrade the strength of our RSA ssh host key from 1024 bits to the same length that we recommend you use, 4096 bits.</p>
<p>The correct/current RSA host key:</p>
<blockquote><p>fc:89:a1:64:74:70:a4:82:58:c1:73:4e:72:59:63:56</p></blockquote>
<p><span id="more-46"></span><br />
This naturally changes the host key fingerprint, and is likely to cause warnings for people who have previously connected.  This doesn&#8217;t indicate a security problem, but you should compare the new key against its fingerprint.</p>
<p>You can verify this on our secure site in <a href="https://members.nearlyfreespeech.net/support/faq?q=NFSNsshKeys#NFSNsshKeys">the member FAQ</a>.</p>
<p>We have not changed the DSA key, as limitations in the DSA algorithm render longer keys pointless.  We discourage but will continue to allow the use of DSA keys.</p>
<p>We apologize to anyone alarmed by the ALL CAPITAL LETTERS!! warnings that ssh applications tend to produce when a host key changes.  The new key length should guarantee that we don&#8217;t have to change it again for a good long time.</p>
<p>In other key-related news, we scanned all member ssh keys for the <a href="http://wiki.debian.org/SSLkeys">Debian weak keys</a> and notified everyone who appeared to be affected.  Since there are exploits in the wild, we will shortly be removing any remaining weak keys, and we will not accept any new weak keys for access to member sites.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2008/05/17/ssh-rsa-host-key-changed/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Public and member UI sites scheduled downtime today</title>
		<link>http://blog.nearlyfreespeech.net/2008/04/14/public-and-member-sites-scheduled-downtime-today/</link>
		<comments>http://blog.nearlyfreespeech.net/2008/04/14/public-and-member-sites-scheduled-downtime-today/#comments</comments>
		<pubDate>Mon, 14 Apr 2008 16:57:27 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/?p=43</guid>
		<description><![CDATA[We will need to take some of our internal servers down briefly today: a couple of times for about 10 minutes each between 6pm and 10pm Arizona time (1am and 5am UTC).

We are completing the buildout of our new &#8220;ultra-redundant power&#8221; rack today and will be moving our master database server, our internal web server, [...]]]></description>
			<content:encoded><![CDATA[<p>We will need to take some of our internal servers down briefly today: a couple of times for about 10 minutes each between 6pm and 10pm Arizona time (1am and 5am UTC).<br />
<span id="more-43"></span><br />
We are completing the buildout of our new &#8220;ultra-redundant power&#8221; rack today and will be moving our master database server, our internal web server, and a couple of other critical infrastructure machines that don&#8217;t already have redundant power sources.</p>
<p>This outage will <b>not</b> affect member services, only our own sites (www, blog, and members), FTP, and ssh will be affected during these brief downtimes.</p>
<p>We apologize for the short notice and hope this is not too disruptive to anyone&#8217;s site maintenance plans.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2008/04/14/public-and-member-sites-scheduled-downtime-today/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Facility move post-mortem analysis</title>
		<link>http://blog.nearlyfreespeech.net/2007/09/27/facility-move-post-mortem-analysis/</link>
		<comments>http://blog.nearlyfreespeech.net/2007/09/27/facility-move-post-mortem-analysis/#comments</comments>
		<pubDate>Thu, 27 Sep 2007 00:52:32 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/2007/09/27/facility-move-post-mortem-analysis/</guid>
		<description><![CDATA[It has been a little over two weeks since our facility move.  I believe at some point we promised some post-mortem analysis of what went right and what went wrong, and here it is.
The outcome of the move is a success.  All of our core hardware is in one place, it is all [...]]]></description>
			<content:encoded><![CDATA[<p>It has been a little over two weeks since our facility move.  I believe at some point we promised some post-mortem analysis of what went right and what went wrong, and here it is.</p>
<p>The outcome of the move is a success.  All of our core hardware is in one place, it is all working, and we are in a better position to grow and expand than we have ever been.<br />
<span id="more-33"></span><br />
Our network monitoring has been vastly improved, with more improvements on the way, which will allow us to be a lot more proactive about keeping your sites healthy in the future, rather than waiting for you to complain when there&#8217;s a problem.  </p>
<p>Our network infrastructure has been greatly improved.  We have incredibly more bandwidth available on our internal network than ever before, so much that we are still tuning the most effective way to utilize it all to serve your sites as quickly as possible.  Our cluster technology has always had the ability to failover your site from one web server to another if there&#8217;s a problem, but we now have similar (but faster) hot failover technology for our routers, firewalls, and edge proxies.</p>
<p>Our network defenses have also received a sharp boost.  We have more and better firewall capability, better inbound packet scrubbing, better backup access to our equipment during attacks, and the ability to selectively blackhole attackers based on complex criteria before they ever reach NearlyFreeSpeech.NET.  Not only will this pay off big time in the event of future DDOS attacks, but it will manifest as better protection against everyday nonsense like large spam runs.</p>
<p>You&#8217;ll be hearing more about a concrete example of the effect of some of these improvements in a near-future blog post.</p>
<p>While the outcome is successful, the implementation of the move was&#8230; not a great success.  While nothing specific went horribly wrong, a number of little factors conspired to make the downtime last longer than expected.</p>
<p>First, we estimated the time required to disconnect, transport, and install our equipment very accurately.  However, we (I) drastically underestimated the amount of time required to cable the equipment at the equipment at the new location.  We use a partial mesh cluster topology, so it is not a simple matter of plugging each server into the closest switch; servers have, on average, 4-6 connections each, and some of those connections required quite a bit of cable routing.  As a result, cabling  took several hours more than expected and became the single most time-consuming task.</p>
<p>Second, our &#8220;placeholder pages&#8221; that were intended to be up during the move did work sometimes, but they were dependent on equipment at the new location that wound up needing to be reconfigured once the equipment from the old location arrived.  We did not spot the unresolved dependency that caused this during the planning stages.</p>
<p>Third, a couple of our new servers, which had been very thoroughly tested prior to the move, completely flipped out on us as soon as they were brought up in the live configuration.  They were quickly stabilized but their timing absolutely sucked.</p>
<p>After the move was completed and things started working again, we encountered a number of additional problems, most notably errors from our network edge proxies: Connection Refused, Host not responding, Host is Down, and Unknown Site.  Some of these were related to the new configuration, but several were related to previously unexercised (or undetected) bugs in our clustering software that have now been fixed.</p>
<p>Unfortunately, it is not really useful to say &#8220;We have learned (something) from this experience about what to do next time.&#8221;  That is because, having attempted what we did, only a complete idiot would do it again.</p>
<p>Should we ever need to move facilities in the future, no matter how long it takes or how much it costs, we will just build out the new facility in its entirety, move all the services between the two live facilities, and then burn down the old one for the insurance money.*</p>
<p>We have also renumbered into new IP addresses.  This is temporary, and we will renumber again once our network brings some additional carriers online, with the primary holdup being bureaucracy.  (The only reason the Internet hasn&#8217;t run out of IP addresses already is that the regional Internet registries make the application process so very tedious and time-consuming that it discourages all but the most determined applicants, like us.)  </p>
<p>At this point, although we&#8217;re still in the period where most people automatically assume most problems are move-related even though most aren&#8217;t, we do still have a few remaining issues we are looking into:</p>
<ul>
<li>We have scattered reports that DNS performance may not be satisfactory. Although there&#8217;s nothing specific to back that up, we are developing a way to instrument our DNS performance to measure speed and reliability, rather than blindly making changes we think might help and hoping for the best.  In the mean time, if you have any problems like this please contact us, especially if you can document them.
</li>
<li>Although it was not directly move-related, some people have encountered a unintended change in the default behavior of our network when Apache issues an internal redirect, causing their sites to fall back on the example.nfshost.com alias if, for example, someone enters a directory name without the trailing slash.  This change is actually the result of a bug fix and represents the correct behavior, but we understand that people find it annoying.  There is an .htaccess workaround for enforcing canonical site URLs in our Member Wiki, but we are working on allowing you to specify the canonical name for your site from our member interface, which will then be used in all such redirects.</li>
<li>The block of IP addresses that we are temporarily using is on a blacklist maintained by a secret cabal of British P2P users that apparently like to trade illegal files and don&#8217;t want to get caught.  They originally had the range blacklisted &#8220;a lot&#8221; because of former users of the same IP addresses, but switched it to &#8220;just a little bit&#8221; (rather than removing us) because they were apparently worried that NearlyFreeSpeech.NET was really just a front for the RIAA, but subsequently blacklisted us &#8220;a lot&#8221; again for &#8220;not showing any gratitude&#8221; about being incorrectly blacklisted only a little bit.  So we&#8217;re not just spies, we&#8217;re <em>ungrateful</em> spies.  Seriously, I couldn&#8217;t make this stuff up if I tried.  Fortunately, this has no effect at all unless:
<ol>
<li>you are a NearlyFreeSpeech.NET member,</li>
<li>you run a P2P blocking application called &#8220;Peer Guardian,&#8221;</li>
<li>you edit your site with FTP.</li>
</ol>
<p>If all those are the case, when you access your site with FTP you will receive a warning that you can bypass by clicking on it.  We have specific instructions available if you need them, but I think the handful of affected people have already contacted us.</p>
<p><strong>This has no effect on web access to your site</strong>, and will most likely not be an issue at all after we renumber our IP&#8217;s again.  If you have any concerns about it, please feel free to contact us.</li>
<li>The blog-to-email service we were using appears to have imploded.  We&#8217;ve selected a replacement, and we&#8217;ll be setting that up shortly, so email notification of NearlyFreeSpeech.NET news &#038; announcements will be possible soon.</li>
<li>One thing I noticed during the downtime was how liberating it was to have an open issue and the offsite status page that let us give out real-time updates.  I want to find some way for us to do that on an ongoing basis.  Something less &#8220;significant&#8221; than a blog post; almost like an internal members-only <a href="http://twitter.com/">Twitter</a> that we could update multiple times a day so you can find out what&#8217;s going on right now and what we&#8217;ve been working on lately to make your service better.  Provided we give you the tools to find what you want, we really can&#8217;t give you too much information.</li>
</ul>
<p>Beyond those things, everything seems to be back to normal.  We&#8217;ve still got a mountain of cleanup work, but give us a couple of weeks and we should be able to turn our focus back to the sort of new feature development that really differentiates NearlyFreeSpeech.NET from the mob of cookie cutter web hosts.</p>
<p>I want to take this opportunity to thank our members for all of their support, patience, and understanding during and after our move.  We are so grateful for all of you.  With that said, I don&#8217;t want to trivialize the downtime we experienced during the move (or the issues that led up to it).  We are very sorry to everyone about the downtime caused by our move, and for the issues of the past couple of months that led up to such drastic action.</p>
<p>Naturally we did receive a few fairly snarky messages about the move and downtime, but that&#8217;s human nature, and frankly we got fewer than we expected.  I&#8217;d like to offer a special apology to those few people for putting them in a position where they felt like that was their best recourse.</p>
<p>To close out the subject of the move, I would love to be able to state firmly, right here and now, &#8220;We will <strong>never</strong> do <strong>that</strong> again!&#8221; but the reality is that <em>never</em> is a really long time.  </p>
<p>What I <em>can</em> say is that as a result of this move, we&#8217;ve gone from having almost no control over our network to having almost complete control and we&#8217;re chiseling away at the &#8220;almost&#8221; even now.  We&#8217;re also now able to start building our own network, which will mean the ability to spread out to multiple locations.  At that point, moving no longer affects everything we own.  It just becomes a turn up in one location followed by a turn down in another.  All the while providing the high quality of service you expect from NearlyFreeSpeech.NET.</p>
<p>Thanks again!</p>
<p>*Note: That was a joke. Kids! When it comes to arson, environmentally damaging disposal of old computer equipment, and insurance fraud: just say &#8220;No!&#8221;</p>
<p>(Just a reminder: our blog is not a venue for member support and we can&#8217;t discuss issues specific to your service with you in public due to our Privacy Policy.  If you&#8217;re having a problem with your service, whether you think it&#8217;s move-related or not, please submit a <a href="https://members.nearlyfreespeech.net/support/request">secure support request</a> so we can help you out.)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2007/09/27/facility-move-post-mortem-analysis/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Planned downtime for Monday, September 10</title>
		<link>http://blog.nearlyfreespeech.net/2007/09/07/planned-downtime-for-monday-september-10/</link>
		<comments>http://blog.nearlyfreespeech.net/2007/09/07/planned-downtime-for-monday-september-10/#comments</comments>
		<pubDate>Fri, 07 Sep 2007 04:35:39 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/2007/09/07/planned-downtime-for-monday-september-10/</guid>
		<description><![CDATA[On Monday, September 10, 2007 at around noon Arizona time (3pm Eastern, 7pm UTC), we will temporarily shut down our entire network to complete a migration of our equipment to a new datacenter.  We anticipate that it will take four to eight hours to complete the move.  All our services will be offline [...]]]></description>
			<content:encoded><![CDATA[<p>On Monday, September 10, 2007 at around noon Arizona time (3pm Eastern, 7pm UTC), we will temporarily shut down our entire network to complete a migration of our equipment to a new datacenter.  We anticipate that it will take four to eight hours to complete the move.  All our services will be offline during that time.<br />
<span id="more-32"></span><br />
<strong>Q. Why are you moving your datacenter?</strong></p>
<p>We have been collocated with Limelight Networks since our inception in 2002.  While Limelight is an excellent network, and their newest datacenters are works of art, we have run into a number of issues that have conspired to push us in a different direction:</p>
<p>1) Limelight recently completed a successful IPO based on their highly successful and incredibly awesome CDN service, and as such they are now almost exclusively focused on that service.  In fact, they no longer offer collocation as a product for companies like us.  We do not want to find ourself in a position where we cannot expand because the facilities are no longer available.</p>
<p>2) At Limelight, they run the datacenter and they are our only Internet carrier.  While that&#8217;s very convenient for us, it&#8217;s putting all our eggs in one basket, which is not good for redundancy.  We need (and have obtained) a carrier-neutral facility that will enable us to connect directly to as many carriers as we like, improving our overall reliability and also empowering us to do things like cherry-pick carriers that will improve overseas connectivity.</p>
<p>3) With multiple carriers, we will be able to request our own IP assignments directly from ARIN, which will remove our exposure to certain types of problems caused by routing issues on other networks.</p>
<p>4) We will be behind dedicated hot-failover routers and firewalls under our control, which will improve our reliability and enable us to react much more quickly to DDOS attacks, eliminating an entire class of such attacks through proactive filtering, and mitigating the rest much more quickly because we will be able to do it ourselves without depending on any third party.</p>
<p>5) We will be retiring some legacy equipment that is constraining our network performance but is too critical and too deeply wired in to address any other way.</p>
<p>6) We will be deploying an infrastructure capable of scaling to at least 10Gbps.</p>
<p>As you can see, this move represents us stepping up and taking control over a lot of areas where we have previously been dependent on others.</p>
<p><strong>Q. Why are you moving your datacenter <em>now</em>?</strong></p>
<p>In July and August, we experienced an unacceptable number of reliability issues.  While the causes of these issues were almost exclusively not &#8220;our fault&#8221; and factors beyond our control (DDOS attacks, routing problems at relatively distant carriers, etc.), at some point it <em>becomes</em> our fault for not taking control of those factors, which is what this move is doing for us. </p>
<p><strong>Q. Why will the downtime be so long?</strong></p>
<p>While we can&#8217;t say for certain how long it will take, here are some of the reasons we estimate it will take so long:</p>
<p>1) It&#8217;s a lot of equipment.</p>
<p>2) We are actually consolidating two datacenters into one.</p>
<p>3) Virtually all of our production infrastructure equipment (routers &#038; switches) will have to be completely reconfigured on the fly.</p>
<p>4) We have to make two trips to ensure that we don&#8217;t have every copy of our members&#8217; data in the same truck at the same time.</p>
<p>5) It&#8217;s much better to set a realistic, conservative timeline than try to rush things and wind up making a critical mistake.</p>
<p><strong>Q. Why didn&#8217;t you find a way to do this without downtime?</strong></p>
<p>Anticipating the need to expand, we spent several months trying to plan a way to do a move without affecting service.  In fact, we have already moved roughly a third of our equipment with relatively little impact.  (Example: our services are supposed to all be N+2 redundant, but right now all the +2&#8217;s are at the new building, leaving us not a lot of margin for error.)  </p>
<p>In the end, a complete zero-downtime move proved impossible.  At some point, your site&#8217;s data is on a disk array, and that array can only be in one place at a time.  The same for your MySQL process.  We tried MySQL replication between facilities on our own master database, and we were disappointed with the results (it blew up at least once a day).  Similarly, the amount of data associated with many sites is just too much and changes too fast to keep it synchronized between two locations.  We have experimented with all sorts of VPN-type solutions, hoping to facilitate &#8220;sneaking&#8221; equipment from one location to another, and they just aren&#8217;t reliable enough to depend on for the length of time it would take to migrate that much data.</p>
<p>Still, such a move is theoretically possible with enough planning and preparation.  However, given the issues we&#8217;ve seen in the past few months, and our current vulnerabilities to those same sorts of issues reoccuring, it becomes a risk assessment question: will the unknown, unplanned downtime caused by external forces during the time it takes us to devise a zero-downtime move be worse for our members than a single, known, scheduled downtime?  We just can&#8217;t take the chance that the answer is yes.</p>
<p><strong>Q. What are you doing to minimize the effects of the downtime?</strong></p>
<p>We have been migrating services and equipment to the new facility as much as we can without causing disruption, and will continue to do so right up until the last minute.  Everything we can move <em>before</em> is something we don&#8217;t have to move <em>during</em>.</p>
<p>Also, rather than leave your web site completely unresponsive, we are going to have a &#8220;maintenance page&#8221; served for all requests indicating to people that your site is not gone or shut down, merely being serviced for a few hours, and asking them to check back later.</p>
<p><strong>Q. Will you be changing your IP addresses during the move?</strong></p>
<p>Yes.  This move will entail a complete IP renumbering on our part. Actually, after it is done, we will have to renumber <em>again</em> a month or so later to satisfy some obscure bureaucratic requirments.  </p>
<p>We will be leaving a handful of equipment at the old location to preserve the old IP addresses and catch people with hardcoded third-party DNS or domain registrations.  Once the second renumber is complete, we will start contacting people using the old addresses to let them know what changes they need to make.</p>
<p>For those people who have their domain registrations and DNS with us, all your IP information will be updated automatically both times with no action required on your part.</p>
<p><strong>Q. How can I stay up-to-date on the downtime?</strong></p>
<p>As soon as the downtime arrives, we will begin posting updates on our <a href="http://status.nearlyfreespeech.net/">offsite status page</a> with our trademark transparency.  We will keep it updated throughout the move, particularly if anything unforeseen happens.</p>
<p><strong>Q. How are you making sure everyone is aware of this?</strong></p>
<p>We know not everyone follows our blog, and that the third-party blog-to-email service we were using recently imploded.  For these reasons, and because of the scope of the move, we will be taking the unusual step of sending out a mass-email to all of our members making them aware of the planned downtime.</p>
<p><strong>Q. Are you sorry you have to do this?</strong></p>
<p>Yes, we are really, really sorry we have to do this, and we hope you will agree that the long-term benefits overwhelm the short-term pain of this downtime.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2007/09/07/planned-downtime-for-monday-september-10/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Offsite Network Status Page</title>
		<link>http://blog.nearlyfreespeech.net/2007/02/19/offsite-network-status-page/</link>
		<comments>http://blog.nearlyfreespeech.net/2007/02/19/offsite-network-status-page/#comments</comments>
		<pubDate>Mon, 19 Feb 2007 06:24:19 +0000</pubDate>
		<dc:creator>jdw</dc:creator>
		
		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[News &amp; Announcements]]></category>

		<category><![CDATA[Updates &amp; Upgrades]]></category>

		<guid isPermaLink="false">http://blog.nearlyfreespeech.net/2007/02/19/offsite-network-status-page/</guid>
		<description><![CDATA[We know that when there is a problem with our services, one of the most important things for us to do is communicate effectively with you about what the problem is and when we expect it to be fixed.  Nothing sucks more than being left in the dark.
Ordinarily, we use System Problem Reports on [...]]]></description>
			<content:encoded><![CDATA[<p>We know that when there is a problem with our services, one of the most important things for us to do is communicate effectively with you about what the problem is and when we expect it to be fixed.  Nothing sucks more than being left in the dark.</p>
<p>Ordinarily, we use System Problem Reports on the Support panel in our member interface to let you know about problems with the service. In most cases, this works well.</p>
<p>However, certain categories of outages, like upstream network failures, can render us unable to post such reports, or leave you unable to read them until after the problem is resolved. While such events should be rare, that&#8217;s the most important time for us to be letting you know what&#8217;s going on.</p>
<p>To that end, we&#8217;ve created an <a href="http://status.nearlyfreespeech.net/">offsite network status page</a> at <a href="http://status.nearlyfreespeech.net/">http://status.nearlyfreespeech.net/</a>. This site runs on one of our servers located in Dallas. That server doesn&#8217;t depend on our Phoenix hosting cluster at all, and will continue to operate even if everything else is offline.<br />
<span id="more-23"></span><br />
The Offsite Network Status Page collects information from three sources:</p>
<p>1) We can post information directly to that site in the event of a serious problem.</p>
<p>2) It attempts to download a list of System Problem Reports every few minutes.</p>
<p>3) It attempts to mirror recent blog posts every few minutes.</p>
<p>If either of the latter two automatic updates fail, that might indicate a problem, so the site will display a message to that effect, even if we haven&#8217;t been able to post any information manually yet.</p>
<p>This is a tool that we hope we don&#8217;t find ourselves using. However, it definitely falls into the category of things that is better to have and not need than to need and not have.  In that spirit, we hope that you will add status.nearlyfreespeech.net as the bookmark you hope you never use.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2007/02/19/offsite-network-status-page/feed/</wfw:commentRss>
		</item>
		<item>
		<title>So many categories</title>
		<link>http://blog.nearlyfreespeech.net/2006/11/15/so-many-categories/</link>
		<comments>http://blog.nearlyfreespeech.net/2006/11/15/so-many-categories/#comments</comments>
		<pubDate>Wed, 15 Nov 2006 08:13:13 +0000</pubDate>
		<dc:creator>flux</dc:creator>
		
		<category><![CDATA[Humor]]></category>

		<category><![CDATA[Network Status]]></category>

		<category><![CDATA[Policy Changes]]></category>

		<category><![CDATA[Politics &amp; Legal]]></category>

		<guid isPermaLink="false">/blog/2006/11/15/just-another-test/</guid>
		<description><![CDATA[So few posts. Until we&#8217;ve got posts in every category we&#8217;ve posted this one to all of them, so you can see what&#8217;s in store and subscribe to the feeds you want before you miss anything.
If this is the only post you see in a category, it just means the category hasn&#8217;t been assigned any [...]]]></description>
			<content:encoded><![CDATA[<p>So few posts. Until we&#8217;ve got posts in every category we&#8217;ve posted this one to all of them, so you can see what&#8217;s in store and subscribe to the feeds you want before you miss anything.</p>
<p>If this is the only post you see in a category, it just means the category hasn&#8217;t been assigned any real posts yet.</p>
<p><!-- b6hlkO3u9r2KGYF --></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.nearlyfreespeech.net/2006/11/15/so-many-categories/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
