Posts

Openfiler project

As part of the lab rebuild, I am setting up an Openfiler box. Some features I am using: iSCSI block-level storage with path redundancy FC block-level storage Block-level replication (potentially have a 2nd host) As I've been setting it up, I'm realizing that it uses LVM for storage which is kinda nice as it dovetails into what I've learned from the clusters at work.  Also realized that fiber channel has way more curb appeal than iSCSI.  I've not had a chance to integrate the FC switch yet, but that's on the list.  What I have done is document (thanks to the internet and some testing) the steps involved in setting up/giving new access to the target and new servers.  For the price, I'd say you can't go wrong with this setup for a home lab. Openfiler has a 4-port FC card ($80 per card) Each ESXi server has a 2-port FC card ($40/per card) FC cables from monoprice ($10/each) Now, the motherboards I am using in the ESXi servers do not have PCI-X s...

Openfiler - Errno 104 (conary updateall)

If anyone is getting this error: /usr/lib64/python2.6/socket.py:381 error: [Errno 104] Connection reset by peer It is probably caused by a proxy or transparent proxy blocking access.  In my case it was my Astaro firewall - I had to add a 'Web Filtering/Exceptions' rule for the Openfiler host.  I set it to allow everything, probably safe - no browsing will be done, just the conary updates. I also found this bug on the Rpath site (even signed up to report!) from 2009, and the dev provided the proxy as a point of interest.  (  https://issues.rpath.com/browse/CNY-1958  ) Once the fix was in place, I decided to run the Update utility from the web GUI (which had also failed before), and it provided me with errors for about 5% of the packages to update, although the rest installed correctly.

Service accounts & domain admin privileges

Over the last few weeks I've had a good couple of lessons around service accounts and domain admin privileges (and who should have them).  What came to mind was a kind of cascading failure caused by not following best practices. Management team finally authorized the changes the windows admin had asked for - the removal of all 'regular' users from the Domain Admins group along with the creation of 'admin' accounts for people that required them. Users were removed from Domain Admins group.  Windows admin did not communicate this to anyone.  Management did not communicate to users that this was going to happen. Random things began to break.  Small in-house-programmed websites stopped working, workflows were disrupted, ticket queue built up, etc. After spending a lot of hours trying to figure out why these things were breaking, someone happened to mention that 'oh, admin removed domain admin privs for everyone'. Light bulb. Confirmed that each and every...

Apache clerestory

As I get going with all this, it's becoming clear I should have a dedicated Apache box.  The wiki box is the obvious example, but might as well start fresh so there's no wiki nonsense buried in there.  Will also give me a chance to document the migration of sites from one server to another. Further, because we're hardcore here, we'll be doing an Apache CLOISTER.  I mean clerestory.  Cluster. From my day job I'm reasonably familiar with clusters, and I'll transfer my wiki info from there to the PTC wiki (yes this is okayed by them).  Obviously info will be sterilized, and frankly a lot will change since I'll be running through the entire process and making corrections/addendums, so there. Clusters.  Uptime.  Fo sho. I am not making this up. The more you know!

Reasons why the wiki is down:

Software issue (service crashed). Hardware issue (like CPU broken). Internet is out. Power is out. Meteor. Update:  In this case, option 3.  Out for an hour.  I should really call Bell and see what the heck.

Core i7 failure

Well, there's a first time for everything.  Actually had a CPU go bad on me.  Thought it was mobo for quite some time, didn't even consider CPU failure as an option.  Thank the Lord I had the spare lab box with another Core i7.  Little upgrade I guess...i7-920 to -950.  However, the CPU has a code that indicates 2008 manufacture, so might be out of warranty. I'll be calling Intel tomorrow, so praying it'll be covered.  If not...no cheap to replace! For anyone interested, the symptoms of the failed CPU are just this: System will no longer post, or show any video.  For all appearances it looks like a dead motherboard, or perhaps dead PSU. update: Ah, was purchased in July 2009, so should be okay. Constructive update: Decided to post my troubleshooting process just in case someone isn't sure: System BSOD'd. I noted the BSOD error and codes via photograph, just in case they would help.  No drivers were mentioned in the BSOD, so chan...

Synology NFS 'access denied' and resolution

Fun time figuring this out, I was being bad and not documenting, but here's what I recall: Kept getting 'access denied by server while mounting' errors when using this command: mount -t nfs 10.0.0.14:/volume1/mysql_backup /srv/backup Checked and re-checked the Synology settings to no avail.  Thought it was something to do with root squash - was not. Correct settings should be correct IP address, RW, No Mapping, Enable Async SSH'd in to the Synology and after some messing about with /etc/exports, I set up tail -f /var/log/messages Took me a while to notice it, but the IP it was registering was the Astaro gateway IP - the Synology and my PC are on different subnets! Set the NFS rule to '*' and started working immediately. Firewalls make it easy to overlook simple things.  I imagine there is some sort of fancy NAT rule for the NFS traffic that would allow specific IPs, but seeing as how I'm technically behind two firewalls and this is a lab, the allow ...