All Confirmation Bias, All The Time

December 1st, 2010

Routing analog input to HDMI output under Linux

Fedora gets close on my audio machine, but it lacks the actual hardware to pass analog input to HDMI output in the audio chip itself– I suspect the route doesn’t exist.

I set the GNOME sound settings (FC14 – pulseaudio based) to Digital Output + Analog Input. Then run (make a script you can click, if you want) this gstreamer pipeline:

gst-launch pulsesrc ! pulsesink

November 16th, 2010

Modern Mac Java Development

The latest releases of XCode don’t ship with Java anymore. You’ll need the Java Developer package from Apple.

(For me, this means downloading Eclipse, Java Developer, Tomcat, and the Java Web tools for Eclipse.)

October 27th, 2010

A scaling, fixed background in CSS

First, right after the body tag:

<img src='background.jpg' alt='' id='fixedbackground'>

Then in the CSS:

#fixedbackground { position: fixed; /* The body will scroll over the image */ top: 0; left: 0; width: 100%; /* Scales to width of the body */ z-index: -1; /* Go BEHIND the body */ } body { background-color: #000; /* Choose a color that matches the bottom edge of the background image. Make the image match this, too, of course. */ }

An example: Leopard Creek Timber Frame

October 21st, 2010

Thai Pizza

For the sauce:

One underripe mango, coarsely diced (though ripe would work fine too, just be a lot sweeter)
One quarter of an onion, chopped fine.
Half a cup of chopped cilantro
Two red ripe hot peppers, chopped fine and discarding the seeds (I used my favorite, fresno peppers, which are about the size of a jalapeño, but one could adjust all the way from a little bit of red bell to habanero peppers)
A dash of fish sauce
A few kaffir lime leaves, I used a pinch of the pickled canned and shredded ones

Sautee the onion, then add the peppers, then cilantro, lime leaves and mango. Add the dash of fish sauce, and cook until dry.

On top of the pizza, I put onion, mushroom, chicken marinated in soy sauce and ginger, and chunks of red bell pepper. Top with mozzarella, bake at 550ºF until the cheese and crust are crisp.

I used an extra-high-protein flour, super high in gluten and stretched super thin (so much gluten that you can’t actually ROLL the dough.) It was awesome.

October 2nd, 2010

RFC: A proposal for asynchronous exception handling

At the moment, if you have an EventEmitter, and it has an unhandled exception, it terminates the node process with an error message. This makes dealing with exceptions in nested event handlers very difficult, since there’s no containing scope other than the closure created for the callback.

I propose keeping track of the ‘current’ EventEmitter in each event handler, snagging a reference to the parent event handler (which should be available on the stack at the time the EventEmitter is created), and emitting an error event on the parent EventEmitter with the exception, passing it ‘upward’, yet in an asynchronous fashion.

Comments?

September 19th, 2010

1152

I just had a most detailed dream, hanging out with a woman from New Zealand, helped her into her wheelchair because she was tired, and had this cute upper lip hair that was eye-catching. I don’t usually dream with lots of details nor remember it clearly after I wake. I kinda like it when I do.

August 25th, 2010

Transitioning from a case-insensitive to a case-sensitive server

Converting sites that have been hosted on a Windows server is often frustrating, as IIS allows files to be accessed with any case in their filename. Here’s a simple solution for a site made of static files, using PHP and an Apache 404 handler:

In 404-case-insensitive.php

<?php

/* Copyright 2010 The Internet Company LLC
 *
 * May be copied under the terms of the MIT software license.
 */

	$directory = dirname($_SERVER['REQUEST_URI']);
	$base = basename($_SERVER['REQUEST_URI']);
	if($directory == '/') $directory = '';

	$potential = glob($_SERVER['DOCUMENT_ROOT'].$directory."/*");

	foreach($potential as $e) {
		$e = basename($e);
		if(strtolower($e) == strtolower($base)) {
			header("Location: $directory/$e");
			exit(0);
		}
	}

	Header("HTTP/1.1 404 File Not Found");
	echo("Page not found.");

?>

And in .htaccess

ErrorDocument 404 /404-case-insensitive.php

August 24th, 2010

Banana and gjetost quesadilla

You have no idea how weird my diet is until you see me eat something like this.

Fry one sliced banana in coconut oil until browned and soft.

Put several strips of gjetost cheese on a tortilla, and melt. Let the tortilla crisp slightly.

Add the bananas and try not to moan in ecstasy while eating.

August 24th, 2010

Ceviches

Mexican-style ceviche

Two fillets of tilapia, cut into 1 cm cubes.

50 ml lime juice

50 ml lemon juice

50 ml rice vinegar

One roasted fresno pepper or ripe jalapeño pepper, chopped finely.

50 g of finely chopped onion.

2 g salt

Let the fish marinate in the rest of the ingredients. Serve and enjoy

Dill trout ceviche

Three tiny or two small trout fillets, cut into small pieces

50 g finely chopped onion

50 ml lemon juice

50 ml balsamic vinegar

5g dill seeds

5g dill weed, cut

2g salt

Marinate the fish in the rest, serve and enjoy.

August 21st, 2010

Node.js Streams

Making an object that speaks the node.js streams interface is surprisingly difficult.

There’s a fair number more interfaces than meet the eye:

You have the interplay of stream.readable and stream.resume()

You have the fact that streams speak in both Buffers and Strings.

sys.pump doesn’t relay errors, so you have to attach handlers to the right objects – I’m not sure if that one’s a problem yet or not.

August 20th, 2010

Stewed eggplant with sweet rice (vegan!)

Eggplant are in season. We’re eating them stewed.

Chop two large eggplant into half inch cubes.

Chop a large onion. Fry it in a generous portion of olive oil.

Add herbs. Tonight’s: oregano, a head of garlic, a bit of paprika. Last night’s: a touch of cinnamon, paprika, oregano, dill. Fry them into the onion, then add the eggplant. Let the eggplant brown slightly, then add two large cans of tomatoes, or a couple pounds of fresh tomatoes. Add a spoonful of sugar, possibly some balsamic vinegar.

Let this cook down. It’ll stick slightly. If so, it’s caramelizing, and that’s just what you want. Don’t let it stick too badly, but it should sizzle when you stir it down to the bottom of the pot.

Let it cook until it’s a thick paste. It won’t be smooth, but it’ll be a really rich spread.

Cook rice, I used a short-grain white rice.

Rehydrate some raisins. Drain them.

Fry a half an onion in a frying pan. Let it start to caramelize and brown. Add a half teaspoon of tumeric and a teaspoon of paprika.

Add a tablespoon of sugar. Let it caramelize slightly. Add the rice, the raisins, and a tablespoon of poppy seeds. Salt just a little.

Serve side by side, let the flavors contrast. The intense richness with the velvety texture of the eggplant, with the sweet chewiness of the rice and raisins. The bright yellow-orange of the rice with the deep red of the eggplant and tomatoes.

August 18th, 2010

Statistics from mail filters

Entities: connections, messages, sending IPs, destination email addresses and domains, sending email addresses and domains

RBL hits per entity
Minimum, maximum, average, mean, deviation
Bad RCPTs per entity
Total RCPTs per entity

I’m sure there’s more, this post will be edited as I think of them.

You can detect VERP senders by having a high correlation of sending domain and receiver email address.

You can detect dictionary attacks by having a high correlation of sending IP, domain or receiver email address and receiving domain.

August 15th, 2010

Mail filter actions

Most mail filters get something major wrong. Most use an ordered list of actions, but limited to narrow scopes, in the order that they occur in SMTP: first check the sender, then the receivers, then check the content.

Mail filter plugins should be run first in order of what phase of processing they need to be in, but evaluated in order of finality of their decision. Check RBLs that outright block hosts first, then ones that are used to decide to quarantine. Then check for viruses, things that will get a message outright rejected or quarantined, then check spam filters.

Execute in parallel, in fact. Many checks involve waiting on networks, disks and other resources, so there’s no reason not to set several actions off at once and wait for completion.

There are several sets of actions that happen: responses to the SMTP client that’s sending us the message, and internal processing of the message, logs, notices to receivers about exceptional events. Once a message is accepted at SMTP time, we no longer have the option to bounce it: if it disappears into the aether, it had better really be junk, because nobody will know what happened to it. Each stream of actions is independent: rules will continue to be evaluated until all specified actions have been satisfied. (smtp, receiver, message, system)

The actions one might want: tempfail, accept, reject, notify, drop, log, record, add-header, add-footer, filter-message, redirect, quarantine, and continue.

The redirect and quarantine actions merely change the destination of the message, and don’t stop processing.

I figure group them numerically, with the highest priority overriding any lower priorities. Let groups be ORed together. Stop when you have a definite answer.

There are two kinds of actions: on`` actions react to the conditions of the group -- if a whitelist matches or not, if a spamfilter returns 'spam', 'not spam' or 'unsure'. ``on .. when actions are triggered when the condition of the when clause matches as well, forming a primitive boolean AND while still respecting an idea of priorities.

`

defaults { on error tempfail all; on success continue all; on any log all; }

group virus { checkcontent clamd; on match reject all, log system, log receiver; }

group user-whitelist { check whitelist; on match accept all; on match when virus match notify receiver; }

group { checkrbl b.barracudacentral.com; checkrbl b.spamcop.org; on match reject all, log system; }

group { checkcontent lmtp:///tmp/spamd.sock; checkcontent blacklistedwords; on spam accept smtp, quarantine message; }

finally { on any accept all; } `

A message comes in from 127.0.0.2: RBLs come up saying to block it. Because no higher rule will accept it, it gets rejected before DATA. The connection attempt is logged to the user, but no message is accepted at all.

A virus-bearing message comes in from 1.2.3.4, from a white-listed sender: RBLs don’t reject it, not being a listed IP. The SMTP connection gets as far as DATA, and the virus scanner is fired off, and returns a ‘virus’ response. The message is rejected on the SMTP side, a notice is sent to the receiver with the details. The whitelist is lower priority than virus scanner, so the message is still rejected. However, since there is also an action aimed at the receiver, that event fires and a notice is sent to the receiver of the message. At this point, evaluation stops since there are no more actions that could happen.

Thoughts and suggestions are welcome.

August 15th, 2010

Mail filter extensibility

The biggest internal requirement that I have for a new mail filter setup is extensibility. The actual decision as to what is and is not spam needs to be left up to modules.

I hesitate to write a system that is a suite of full ACLs, like Exim or Postfix’s access controls. Postfix’s are barely flexible enough to work at all, and Exim’s are so overwhelming and yet limited that you have to be a programmer to write a system that’s not going to break or lose mail, and a clever programmer at that.

Every technique for filtering has a natural place in the flow of things: RBLs are early, at HELO or RCPT TO time; Learning filtering must come after DATA has been received, and could either stream or receive the message as a single dump. Filtering at HELO time should be rare: you can’t check a per-destination whitelist that early. You have to wait for RCPT TO, and in fact, many senders may retry again and again and again if you reject at HELO instead of RCPT TO.

So each plugin receives some part of the SMTP-time data: early ones get IPs and connection-related information, and later ones get the full message data.

Plugins essentially distill their input into a status: “good”, “bad”, “not sure”

August 15th, 2010

Mail filter requirements

It’s time to update the spam filter at The Internet Company again.

I’m getting a lot of feedback from users of both my system and another I administer that they need several different things in a spam filter.

My users need:

The ability to retrieve a filtered message. Even if it’s rejected, in most cases, being able to fetch it from a quarantine is necessary. Some things can be hard-rejects, like virus-infected mails and things from very obvious spam sources, but the grey area needs to be very wide.
Some degree of control over what techniques are used: degree of quarantining, whether blacklists are used, and whether they reject or merely quarantine mail
Whitelisting, both by individual user and by domain.
Blacklisting, both by individual user and by domain, including whether to quarantine or reject.
Ability to retrain a learning filter while still using a POP3 mail client. This means a ‘signature’ with saved fulltext of the message like DSPAM or CRM114’s mailreaver do, so mail can be forwarded back altered by mail clients with no interest in preserving formatting like Microsoft Outlook, or so that there can be a web interface to retrain.

The overall themes here are ‘user control’ and ‘ability to retrieve a missed message’. Spam filters can be highly accurate in practice, with well-trained users who understand how the filters work, but most aren’t accurate enough or careful enough while training to be able to reject mail based on a learning filter alone. Business users could lose a thousand dollars or more on certain emails from previously unknown senders, so the ability to review and recover from the filter’s decisions is very important.