Very interesting article on software rasterization. It describes a lot of optimizations that can be done with modern processors that have vector instructions for perform multiple operations in parallel.