High Performance Libcurl Tips
At SEOmoz, we’ve been able to make use of many great open source packages. One that’s particularly important to our crawling infrastructure is libcurl, which abstracts the logic behind HTTP (and many...
View ArticleHow to share navigation across multiple applications
If you’re an SEOmoz user, you may have noticed that our layout and navigation recently received a major overhaul. Updating the layout wasn’t nearly as simple as you might expect. I was the lead...
View ArticleLaunching and Deploying Instances with Boto and Fabric
A problem that has become kind of an issue for both a personal project and a work project is a matter of deploying new instances, and then maintaining them. There are a number of packages available for...
View ArticleHow to Contribute to an Open Source Project
I contribute to open source on a fairly regular basis. Besides the fact that I enjoy giving back to the community, it also helps make our applications more maintainable: if I can get the bug fix or new...
View ArticleShovel — Rake for Python
I love Python. And while I’m becoming increasingly less religious about which programming language I use, I often find myself gravitating back to Python. That said, we migrated a certain repository of...
View ArticleA look under the hood of Feed Authority
Recently, we built Fresh Web Explorer a large scale feed crawler and search engine that allows inbound marketers to do all sorts of wonderful marketing related things. Dan has already written about...
View ArticleDragnet: Content Extraction from Diverse Feature Sets
In this post we describe Dragnet, our approach to content extraction. This continues our series of deep dives on individual pieces of Fresh Web Explorer (see a description of Fresh Web’s overall...
View ArticleMozscape’s Leap From C++ to Python
Hey Moz fans, Brad here! I’m the technical lead in charge of the Mozscape API, and I have some exciting news to share with you today. Supercharged Mozscape! We recently released a brand new, shiny...
View ArticleTCO: The Nerd’s LOB
Welcome to the first in a series of blog posts we’re planning around Moz’s adventures in cloud computing. You may have heard of the Seahawks’ “Legion of Boom” (LOB). It’s the nickname that the...
View ArticleNear-Duplicate Detection
Duplication of content on the Internet is inevitable, whether it be intentional or accidental, malicious or coincidental. There are a number of reasons that it’s important to identify duplication, and...
View Article
More Pages to Explore .....