rgb-cln

Commit Graph

Author	SHA1	Message	Date
Rusty Russell	2aad3ffcf8	common: tal_dup_talarr() helper. This is a common thing to do, so create a macro. Unfortunately, it still needs the type arg, because the paramter may be const, and the return cannot be, and C doesn't have a general "(-const)" cast. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2020-02-27 14:16:16 +10:30
Vasil Dimov	89ceb273f5	wire: remove towire_double() Before this patch we used to send `double`s over the wire by just copying them. This is not portable because the internal represenation of a `double` is implementation specific. Instead of this, multiply any floating-point numbers that come from the outside (e.g. JSONs) by 1 million and round them to integers when handling them. * Introduce a new param_millionths() that expects a floating-point number and returns it multipled by 1000000 as an integer. * Replace param_double() and param_percent() with param_millionths() * Previously the riskfactor would be allowed to be negative, which must have been unintentional. This patch changes that to require a non-negative number. Changelog-None	2020-02-27 09:07:04 +10:30
Rusty Russell	7ab5c424b6	gossipd: provide (stripped) channel_update when resolving a channel. I hadn't realized that lightningd asks gossipd every time we forward a payment. But I'm going to abuse it here to get the latest channel_update, otherwise (as lightningd takes over error message generation) lightningd needs to do an async request at various painful points. So have gossipd tell us the lastest update (stripped so compatible with the strange in-onion-error format). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2020-02-25 11:12:12 +10:30
Rusty Russell	11dc1b341c	gossipd: hand all candidates up to lightningd to select routeboost. This lets us do more flexible filtering in the next patch. But it also keeps some weird logic out of gossipd. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2020-01-04 08:07:22 +08:00
Rusty Russell	5a95e9f29a	gossipd: remove chainparams local var. We have a global, let's use it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-20 20:41:53 +01:00
Rusty Russell	eed654f684	connectd, gossipd: use per-peer logging. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-18 04:50:22 +00:00
Rusty Russell	5a8677edc6	gossipd: add txout_failure when a close is seen. This prevents a gratuitous lookup of we get a late channel_announce, but even better, it suppresses the "bad gossip" messages in case of a late channel_update, which have plagued Travis (especially since we got aggressive in pushing our own updates). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-07 03:50:53 +00:00
Rusty Russell	7f45e55d84	gossipd: set the push marker for our own messages. This is a better fix than doing it manually, which turned out to do it in the wrong order (node_announcement followed by channel_announcement) anyway. Should fix many "Bad gossip" messages. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-04 17:50:58 +01:00
Rusty Russell	034ed1711c	gossipd: fix memleak when we getnodes has no nodes. In this case, node_arr is NULL. Triggered by the next test. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-14 18:58:44 -05:00
Rusty Russell	ca53c1b699	gossipd: push our own gossip messages harder. I had a report of a 0.7.2 user whose node hadn't appeared on 1ml. Their node_announcement wasn't visible to my node, either. I suspect this is a consequence of recent version reducing the amount of gossip they send, as well as large nodes increasingly turning off gossip altogether from some peers (as we do). We should ignore timestamp filters for our own channels: the easiest way to do this is to push them out directly from gossipd (other messages are sent via the store). We change channeld to wrap the local channel_announcements: previously we just handed it to gossipd as for any other gossip message we received from our peer. Now gossipd knows to push it out, as it's local. This interferes with the logic in tests/test_misc.py::test_htlc_send_timeout which expects the node_announcement message last, so we generalize that too. [ Thanks to @trueptolmy for bugfix! ] Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-14 15:00:37 -05:00
Rusty Russell	bd55f6d940	common/features: only support a single feature bitset. This is mainly an internal-only change, especially since we don't offer any globalfeatures. However, LND (as of next release) will offer global features, and also expect option_static_remotekey to be a global feature. So we send our (merged) feature bitset as both global and local in init, and fold those bitsets together when we get an init msg. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-11 02:52:04 +00:00
Rusty Russell	48f0362eae	seeker: handle non-synced state internally. We weren't supposed to do any gossiping until we were synced (and thus knew blockheight), but our seeker_check() didn't wait for it! Move the waiting all into seeker.c, so it can handle it all consistently. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	d2a5f056a8	gossipd: restore dev-suppress-gossip functionality. Don't start new peers, and don't check on existing peers. This should get rid of most gossip. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	7d207c50fa	gossipd: remove some spammy debug messages. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	a1644c1b6e	seeker: start doing a channel probe if we see unknown node_announcement msgs. It usually means we're missing something, but there's no way to ask what. Simply start a broad scid probe. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	82a5efa932	gossipd: start streaming gossip from last gossip timestamp minus 10 minutes. We assume that the time for gossip propagation is < 10 minutes, so by going back that far from last gossip we won't miss anything, Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	70e88b0dfb	gossipd: have seeker control which peers gossip, reduce to 3 and rotate. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	869b5e40b5	gossipd: simplify seeker state machine. We eliminate the "need peer" states and instead check if the random_peer_softref has been cleared. We can also unify our restart handlers for all these cases; even the probe_scids case, by giving gossip credit for the scids as they come in (at a discount, since scids are 8 bytes vs the ~200 bytes for normal gossip messages). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	4b13c92802	seeker: use hash table for unknown short_channel_ids. Instead of a linear array which is fairly inefficient if it turns out we know nothing at all. We remove the gossip_missing() call by changing the api to remove_unknown_scid() to include a flag as to whether the scid turned out to be real or not. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	521c7f7121	seeker: take over gossip control. The seeker starts by asking a peer (the first peer!) for all gossip since a minute before the modified time of the gossip store. This algorithm is enhanced in successive patches. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	55323ec385	gossipd: move gossip seeking routines into new file seeker.c Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	79ca9bf998	gossipd: use per-peer information to make messages clearer. We can (usually) indicate what peer caused the bad gossip error. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	0091300ee3	gossipd: track what peer gave us gossip msgs so we can credit it. Since we have to validate, there can be a delay (and peer might vanish) between receiving the gossip and actually confirming it, hence the use of softref. We will use this information to check that the peers are making progress as we start asking them for specific information. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	296868daf4	gossipd: have gossip_store_load() return a timestamp. This is the modified-time of the file. We have to store it internally since we overwrite the gossip file with compaction on startup. This means the "are we behind on gossip?" heuristic is no longer inside gossip_store.c, which is cleaner. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	d75302deba	gossipd: random_peer() selector. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	33c658ecfb	gossipd: advertize all our features in node_announcement. This preempts the acceptance of https://github.com/lightningnetwork/lightning-rfc/pull/666 but it's clear that feature bits are going to be distinct, so this is safe to do anyway. See https://github.com/lightningnetwork/lightning-rfc/pull/680 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-07 05:51:05 +00:00
Christian Decker	ef7a63d8f8	elements: Move from a global is_elements to a global chainparams We now have a pointer to chainparams, that fails valgrind if we do anything chain-specific before setting it. Suggested-by: Rusty Russell <@rustyrussell>	2019-10-03 04:32:57 +00:00
willcl-ark	8d4203e9a6	[dev-suppress-gossip] - Set new peers to GOSSIP_NONE with flag enabled Signed-off-by: willcl-ark <will8clark@gmail.com>	2019-10-03 04:13:55 +00:00
Rusty Russell	aab9e9f010	gossipd: remove internal dev helpers for queries. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-30 07:08:07 +00:00
Rusty Russell	c07dff21dc	gossipd: generalize query_channel_range to use a callback. This means we'll be able to call it for internal reasons, not just dev testing as now. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-30 07:08:07 +00:00
Rusty Russell	4bf0bc1f28	gossipd: age txout_failures map. We do this by keeping a current and an old map, and moving the current to old every hour or 10,000 entries. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-27 02:32:53 +00:00
Rusty Russell	d24c850899	gossipd: restore a flag for fast pruning I was seeing some accidental pruning under load / Travis, and in particular we stopped accepting channel_updates because they were 103 seconds old. But making it too long makes the prune test untenable, so restore a separate flag that this test can use. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-27 00:01:34 +00:00
Rusty Russell	2b47922ea5	gossipd: move query functions into their own file. The only real change is dump_gossip() used to call maybe_create_next_scid_reply(), but now I've simply renamed that to maybe_send_query_responses() and we call it directly. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-25 04:01:56 +00:00
Rusty Russell	38124ec287	gossipd: don't ask peers for gossip until we're synced with bitcoind. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-25 04:01:56 +00:00
Rusty Russell	fd2d74aa9b	gossipd: defer asking about txouts until we're synced or they're 6 deep. The first one means we don't discard channels just because we're not synced, and the second is implied by the spec: don't accept channel_announcement if the channel isn't 6 deep. Since LND defers in such cases, we do too (unless it's newer than the current block, in which case we simply discard). Otherwise there's a risk that a slow node might discard valid gossip. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-25 04:01:56 +00:00
Rusty Russell	6f9c5f2936	gossipd: get fed the blockheight from lightningd when we know it. This will let gossipd be more intelligent about gossiping before we're synced, and also it might know how far behind we are. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-25 04:01:56 +00:00
Rusty Russell	27790832a5	gossipd: gossip_queries_ex is not longer experimental. The master spec has some typos which make it not parse, so I created a PR and generated the CSV from that: https://github.com/lightningnetwork/lightning-rfc/pull/673 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-22 01:17:11 +00:00
Rusty Russell	895e552475	BOLT: update to master with gossip_queries_ex. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-22 01:17:11 +00:00
Rusty Russell	6a8d18c7e3	gossipd: naming cleanups. Suggested-by: @cdecker. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	147eaced2e	developer: consolidiate gossip timing options into one --dev-fast-gossip. It's generally clearer to have simple hardcoded numbers with an #if DEVELOPER around it, than apparent variables which aren't, really. Interestingly, our pruning test was always kinda broken: we have to pass two cycles, since l2 will refresh the channel once to avoid pruning. Do the more obvious thing, and cut the network in half and check that l1 and l3 time out. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	46e0f1efcc	gossipd: refresh every 13 days, not every 7. One day is plenty of time to propagate the update. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	21a6d502db	gossipd: move gossip message generation into its own file. gossipd.c is doing too many things: this is a start. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	0bab2580fc	gossipd: clean up local channel updates. Make update_local_channel use a timer if it's too soon to make another update. 1. Implement cupdate_different() which compares two updates. 2. make update_local_channel() take a single arg for timer usage. 3. Set timestamp of non-disable update back 5 minutes, so we can always generate a disable update if we need to. 4. Make update_local_channel() itself do the "unchanged update" suppression. gossipd: clean up local channel updates. 5. Keep pointer to the current timer so we override any old updates with a new one, to avoid a race. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	e1c431d278	gossipd: use local_chan_map more. We can look up local channels directly now, which offers simplifcations. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	70c4ac6d74	gossipd: suppress our own too-close node_announcement messages. Never make them less than gossip_min_interval apart. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	178baeba6c	gossipd: get gossip_min_interval from lightningd. Default is 5 x gossip interval == 5 minutes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	4cfd0524eb	gossipd: simplify duplicate node_announcement check. Write helpers to split it into non-timestamp, non-signature parts, and simply compare those. We extract a helper to do channel_update, too. This is more generic than our previous approach, and simpler. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	5ddd7866e4	gossipd: make create_node_announcement const-correct. sig is only non-const so we can override if NULL, but talz helps us here. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
trueptolemy	5361a5d059	JSON-API: `getroute` now also support `exclude` nodes	2019-09-16 12:22:06 +08:00
Rusty Russell	91072f56b0	developer: add 'dev-gossip-set-time' call to manipulate gossipd's time. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-12 05:11:56 +00:00
Rusty Russell	2577ad87d5	gossipd: use gossip_time_now() everywhere. We've been slack, but it's going to be important for testing ratelimiting. And it currently has a minor memory leak. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-12 05:11:56 +00:00
lisa neigut	fe6c7f8f80	gossip queries: patch up valgrind errors in tests These were giving me valgrind errors locally; fixed now.	2019-09-11 23:56:27 +00:00
darosior	0b0ad4c22d	transition from status_trace() to status_debug	2019-09-10 02:02:51 +00:00
darosior	ea6c95b2b3	gossipd: don't ignore wrong chain in 'query_channel_range' Give a NULL reply with the 'complete' flag to 0 instead	2019-09-10 02:02:51 +00:00
darosior	9be28fe40f	daemons tour: minor typos correction	2019-09-10 02:02:51 +00:00
Rusty Russell	aca2e4f722	common/memleak: add dynamic hooks for assisting memleak. Rather than reaching into data structures, let them register their own callbacks. This avoids us having to expose "memleak_remove_xxx" functions, and call them manually. Under the hood, this is done by having a specially-named tal child of the thing we want to assist, containing the callback. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-06 14:35:01 +02:00
Rusty Russell	28185c397c	gossipd: fix gossip send in case query_flags cause no output. Fortunately, again, only happens with EXPERIMENTAL_FEATURES. If the query causes us not to actually send anything, we won't get called again. This can validly happen if they only asked for the node_announcements, for example. (Found by protocol tests). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-27 12:35:25 +02:00
Rusty Russell	d1a1592cc8	gossipd: fix calculation of crc32 of update. Currently EXPERIMENTAL_FEATURES only, fortunately. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-27 12:35:25 +02:00
Rusty Russell	0ec8304901	gossipd: fix premature towire_reply_short_channel_ids_end if no node_announcement. Our "are we finished?" logic was wrong: it tested if there are no more node_announcements, but it's possible that there were no node_announcements for either end of the channel whose information we sent. This is actually quite unusual on the real network: looking at mainnet statis from last May, 4301 of 4337 nodes have node_announcements. However, with query flags it's much more likely, since they might not ask for node announcements at all. (Found by gossip protocol tests) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-26 23:09:00 +00:00
Rusty Russell	2f1e116510	gossipd: use htable_count() rather than reaching into htable struct. Now ccan/htable provides the helper, let's use it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-26 08:44:22 +00:00
Rusty Russell	4de47f6db5	gossipd: use default zlib compression, hack for zlib expansion. These both allow us to reproduce the test vectors in the next patch. But using Z_DEFAULT_COMPRESSION is a reasonable idea anyway. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-10 02:48:34 +00:00
Rusty Russell	f6cf4bf62a	spec: remove encoding byte from checksums. Make the TLV element a simple array. This is a bit neater, in fact, and makes the test vectors in that 557 PR work. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-10 02:48:34 +00:00
Rusty Russell	8abd850d3c	gossipd: append timestamps & checksums to reply_channel_range if asked (EXPERIMENTAL) In fact, we always generate them, we only send them if asked. And we set the flags to 0 if not --enable-experimental-features, so we never send in that case. Generating checksums involves pulling the channel_update from the gossip_store, which is suboptimal: there's a FIXME to store the checksum in memory. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-10 02:48:34 +00:00
Rusty Russell	c7853197ae	gossipd: generalize encoding functions We're about to use the for gossip extended info too, which don't put the encoding byte at the beginning of the data stream. So this removes some "scids" from function names and separates out the "prepend a byte" case from the "external encoding_type" case. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-10 02:48:34 +00:00
Rusty Russell	0de11da5e4	gossipd: decode and obey query_short_channel_ids's TLV query_flags (EXPERIMENTAL) These indicate what fields we are to return. If there's now TLV, or we haven't got --enable-experimental-features, it's set to all 1s so behaviour is unchanged. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-10 02:48:34 +00:00
Rusty Russell	d2030539e1	EXPERIMENTAL: pull in PR 557 (with minor fixes): range query support. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-10 02:48:34 +00:00
lisa neigut	5c07afac7d	bolt: update to BOLT spec changes (extract format + type specifications) updates the bolt version to 6639cef095a2ecc7b8f0c48c6e7f2f906fbfbc58. this requires us to use the new bolt parser at generate-bolt.py and updates to all of the type specifications (ie. from u8 -> byte)	2019-07-16 06:10:58 +00:00
Rusty Russell	fc27250f80	gossipd: be more verbose and less assert()ive on bad node_announcement. We hit the timestamp assert on #2750; it shouldn't happen, but crashing doesn't leave much information. Reported-by: @m-schmook Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-21 22:03:35 +00:00
Rusty Russell	1e32b4ab29	gossipd: adjust gossip filters if we discover we're missing gossip. We pick up to three random peers and ask them to gossip more. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	6830233d0b	gossipd: control gossip level so we don't get flooded by peers. We seek a certain number of peers at each level of gossip; 3 "flood" if we're missing gossip, 2 at 24 hours past to catch recent gossip, and 8 with current gossip. The rest are given a filter which causes them not to gossip to us at all. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	f5ea57d4c0	gossipd: reset gossip_missing if no reports for 10 minutes. An arbitrary timeout. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	b9053767e7	gossipd: query unknown short_channel_ids, note if they were really missing. The first sign that we're missing gossip is that we get a channel_update for an unknown channel. The peer might be wrong (or lying), but if it turns out to be a real channel, we were definitely missing something. This patch does two things: queries when we get an unknown channel_update, and then notes that a channel_announcement was from such an update when it's finally processed. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	18069ab3da	gossipd: APIs return more information about routing message handling. In particular, we'll need to know the short_channel_id if a channel_update is unknown (implies we're missing a channel), and whether processing a pending channel_announcement was successful (implies that the channel was real). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	5ef7aa70d2	gossipd: prepare for internally-generated short-channel-id queries. Up until now we only generated these in dev mode for testing. Hoist into common code, turn counter into a flag (we're only allowed one!) and note if query is internal or not. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	21c920a8e8	gossipd: note if loaded store seems reasonably up-to-date. If not, we can ask peers for full gossip (for now we just set a flag). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	ab31f40aa2	gossipd: don't charge ourselves fees when calculating route. This means there's now a semantic difference between the default `fromid` and setting `fromid` explicitly to our own node_id. In the default case, it means we don't charge ourselves fees on the route. This means we can spend the full channel balance. We still want to consider the pricing of local channels, however: there's a reason to discount one over another, and that is to bias things. So we add the first-hop fee to the risk value instead. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-11 23:19:11 +00:00
Rusty Russell	b48c644e7a	listchannels: add `htlc_minimum_msat` and `htlc_maximum_msat` fields. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-11 23:19:11 +00:00
Rusty Russell	3e733afb2b	gossipd: remove broadcast map altogether. This clarifies things a fair bit: we simply add and remove from the gossip_store directly. Before this series: (--disable-developer, -Og) store_load_msec:20669-20902(20822.2+/-82) vsz_kb:439704-439712(439706+/-3.2) listnodes_sec:0.890000-1.000000(0.92+/-0.04) listchannels_sec:11.960000-13.380000(12.576+/-0.49) routing_sec:3.070000-5.970000(4.814+/-1.2) peer_write_all_sec:28.490000-30.580000(29.532+/-0.78) After: (--disable-developer, -Og) store_load_msec:19722-20124(19921.6+/-1.4e+02) vsz_kb:288320 listnodes_sec:0.860000-0.980000(0.912+/-0.056) listchannels_sec:10.790000-12.260000(11.65+/-0.5) routing_sec:2.540000-4.950000(4.262+/-0.88) peer_write_all_sec:17.570000-19.500000(18.048+/-0.73) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	728bb4e662	common/gossip_store: handle timestamp filtering. This means we intercept the peer's gossip_timestamp_filter request in the per-peer subdaemon itself. The rest of the semantics are fairly simple however. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	5591c0b5d8	gossipd: don't send gossip stream, let per-peer daemons read it themselves. Keeping the uintmap ordering all the broadcastable messages is expensive: 130MB for the million-channels project. But now we delete obsolete entries from the store, we can have the per-peer daemons simply read that sequentially and stream the gossip itself. This is the most primitive version, where all gossip is streamed; successive patches will bring back proper handling of timestamp filtering and initial_routing_sync. We add a gossip_state field to track what's happening with our gossip streaming: it's initialized in gossipd, and currently always set, but once we handle timestamps the per-peer daemon may do it when the first filter is sent. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	a5f6ef385a	gossipd: don't wrap messages when we send them to the peer. They already send us gossip messages, so they have to be distinct anyway. Why make us both do extra work? Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	df00f20e4a	gossipd: erase old entries from the store, don't just append. We use the high bit of the length field: this way we can still check that the checksums are valid on deleted fields. Once this is done, serially reading the gossip_store file will result in a complete, ordered, minimal gossip broadcast. Also, the horrible corner case where we might try to delete things from the store during load time is completely gone: we only load non-deleted things. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	43f2cbd250	gossipd: track gossip_store locations of local channels. We currently don't care, but the next patch means we have to find them again. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	d1f43d993a	gossipd: use explicit destructor for struct chan. Each destructor2 costs 40 bytes, and struct chan is only 120 bytes. So this drops our memory usage quite a bit: MCP bench results change: -vsz_kb:580004-580016(580006+/-4.8) +vsz_kb:533148 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-22 11:28:44 +00:00
Rusty Russell	59e75f1b2c	gossipd: reply to large listchannels in parts. This has two effects: most importantly, it avoids the problem where lightningd creates a 800MB JSON blob in response to listchannels, which causes OOM on the Raspberry Pi (our previous max allocation was 832MB). This is because lightning-cli can start draining the JSON while we're filling the buffer, so we end up with a max allocation of 68MB. But despite being less efficient (multiple queries to gossipd), it actually speeds things up due to the parallelism: MCP with -O3 -flto before vs after: -listchannels_sec:8.980000-9.330000(9.206+/-0.14) +listchannels_sec:7.500000-7.830000(7.656+/-0.11) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-22 11:28:44 +00:00
Rusty Russell	cb9c44ef27	gossipd: remove unnecessary dev_unknown_channel_satoshis arg. We now have a test blockchain for MCP which has the correct channels, so this is not needed. Also fix a benchmark script bug where 'mv "$DIR"/log "$DIR"/log.old.$$' would fail if you log didn't exist from a previous run. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-22 11:28:44 +00:00
darosior	d9db9dc1ae	gossipd: fix listnodes crash on non existing id 'node_arr' was not instanciated if an id was passed to listnodes and we could not get a node from it	2019-05-16 19:30:10 +02:00
Rusty Russell	f5a218f9d1	gossipd: send per-peer daemons offsets into gossip store. Instead of reading the store ourselves, we can just send them an offset. This saves gossipd a lot of work, putting it where it belongs (in the daemon responsible for the specific peer). MCP bench results: store_load_msec:28509-31001(29206.6+/-9.4e+02) vsz_kb:580004-580016(580006+/-4.8) store_rewrite_sec:11.640000-12.730000(11.908+/-0.41) listnodes_sec:1.790000-1.880000(1.83+/-0.032) listchannels_sec:21.180000-21.950000(21.476+/-0.27) routing_sec:2.210000-11.160000(7.126+/-3.1) peer_write_all_sec:36.270000-41.200000(38.168+/-1.9) Signficant savings in streaming gossip: -peer_write_all_sec:48.160000-51.480000(49.608+/-1.1) +peer_write_all_sec:35.780000-37.980000(36.43+/-0.81) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-13 05:16:18 +00:00
Rusty Russell	d8db4e871f	gossipd: provide new fd to per-peer daemons when we compact it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-13 05:16:18 +00:00
Rusty Russell	13717c6ebb	gossipd: hand a gossip_store_fd to all subdaemons. This will let them read from the gossip store directly. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-13 05:16:18 +00:00
Rusty Russell	89291b930e	gossipd: pass amount into gossip_store, rather than having it fetch. We need to store the channel capacity for channel_announcement: hand it in directly rather than having the gossip_store code do a lookup. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-13 05:16:18 +00:00
Rusty Russell	c7034f271a	gossipd: avoid tal overhead in listnodes We know exactly how many there will be, so allocate an entire array up-front. -listnodes_sec:2.540000-2.610000(2.584+/-0.029) +listnodes_sec:2.100000-2.170000(2.118+/-0.026) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-05-13 05:16:18 +00:00
Rusty Russell	261921dee2	gossipd: adjust peers' broadcast_offset when compacting store. When we compact the store, we need to adjust the broadast index for peers so they know where they're up to. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	fdb42c3170	gossipd: don't keep channel_updates in memory. This requires some trickiness when we want to re-add unannounced channels to the store after compaction, so we extract a common "copy_message" to transfer from old store to new. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:36034-37853(37109.8+/-5.9e+02) vsz_kb:577456 store_rewrite_sec:12.490000-13.250000(12.862+/-0.27) listnodes_sec:1.250000-1.480000(1.364+/-0.09) listchannels_sec:30.820000-31.480000(31.068+/-0.24) routing_sec:26.940000-27.990000(27.616+/-0.39) peer_write_all_sec:65.690000-68.600000(66.698+/-0.99) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1202316 +vsz_kb:577456 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	aeb72a05e3	gossipd: remove some fields from struct chan. The txout_script field is unused; the local_disable only applies to the handful of local channels, so move that into a hash table. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:39207-45089(41374.6+/-2.2e+03) vsz_kb:1202316 store_rewrite_sec:15.090000-16.790000(15.654+/-0.63) listnodes_sec:1.290000-3.790000(1.938+/-0.93) listchannels_sec:30.190000-32.120000(31.31+/-0.69) routing_sec:28.220000-31.340000(29.314+/-1.2) peer_write_all_sec:66.830000-76.850000(71.976+/-3.6) MCP notable changes from previous patch (>1 stddev): -store_load_msec:35107-37944(36686+/-1e+03) +store_load_msec:39207-45089(41374.6+/-2.2e+03) -vsz_kb:1218036 +vsz_kb:1202316 -listchannels_sec:28.510000-30.270000(29.6+/-0.6) +listchannels_sec:30.190000-32.120000(31.31+/-0.69) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	3280466e19	gossipd: don't keep channel_announcement messages in memory. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35107-37944(36686+/-1e+03) vsz_kb:1218036 store_rewrite_sec:14.060000-17.970000(15.966+/-1.6) listnodes_sec:1.270000-1.350000(1.314+/-0.034) listchannels_sec:28.510000-30.270000(29.6+/-0.6) routing_sec:30.230000-31.510000(30.83+/-0.44) peer_write_all_sec:67.390000-70.710000(68.568+/-1.2) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1780516 +vsz_kb:1218036 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	2fd4a0121f	gossipd: unify is_chan_public / is_chan_announced. We used to have a `struct chan` while we're waiting for an update; now we keep that internally. So a `struct chan` without a channel_announcement in the store is private, and other is public. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	aafc489edb	gossipd: remove info fields from struct node. Reload them from disk if they do listnodes. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35390-38659(37336.4+/-1.3e+03) vsz_kb:1780516 store_rewrite_sec:13.800000-16.800000(15.02+/-0.98) listnodes_sec:1.280000-1.530000(1.382+/-0.096) listchannels_sec:28.700000-30.440000(29.34+/-0.68) routing_sec:30.120000-31.080000(30.526+/-0.35) peer_write_all_sec:65.910000-76.850000(69.462+/-4.1) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1792996 +vsz_kb:1780516 -listnodes_sec:1.030000-1.120000(1.068+/-0.032) +listnodes_sec:1.280000-1.530000(1.382+/-0.096) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	0608c36301	gossipd: don't keep node_announcement messages in memory. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:34779-38628(36903.4+/-1.4e+03) vsz_kb:1792996 store_rewrite_sec:14.440000-15.040000(14.672+/-0.24) listnodes_sec:1.030000-1.120000(1.068+/-0.032) listchannels_sec:27.860000-32.850000(30.05+/-1.7) routing_sec:30.020000-31.700000(31.044+/-0.56) peer_write_all_sec:65.100000-70.600000(68.422+/-2) -vsz_kb:1780516 +vsz_kb:1792996 -listnodes_sec:1.280000-1.530000(1.382+/-0.096) +listnodes_sec:1.030000-1.120000(1.068+/-0.032) MCP notable changes from previous patch (>1 stddev): -store_load_msec:30640-33236(32202+/-8.7e+02) +store_load_msec:34779-38628(36903.4+/-1.4e+03) -vsz_kb:1812956 +vsz_kb:1792996 -listnodes_sec:0.590000-0.660000(0.62+/-0.033) +listnodes_sec:1.030000-1.120000(1.068+/-0.032) -peer_write_all_sec:60.380000-61.320000(60.836+/-0.37) +peer_write_all_sec:65.100000-70.600000(68.422+/-2) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	3ef767fd52	gossipd: don't use cached node_announcement for redundancy checking Re-parse the existing message, since we'e going to get rid of those fields. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	6b9069ee28	broadcast: don't keep payload pointer. If we need the payload, pull it from the gossip store. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:30189-52561(39416.4+/-8.8e+03) vsz_kb:1812904 store_rewrite_sec:21.390000-27.070000(23.596+/-2.4) listnodes_sec:1.120000-1.230000(1.176+/-0.044) listchannels_sec:38.900000-50.580000(44.716+/-3.9) routing_sec:45.080000-48.160000(46.814+/-1.1) peer_write_all_sec:58.780000-87.150000(72.278+/-9.7) MCP notable changes from previous patch (>1 stddev): -vsz_kb:2288784 +vsz_kb:1812904 -store_rewrite_sec:38.060000-39.130000(38.426+/-0.39) +store_rewrite_sec:21.390000-27.070000(23.596+/-2.4) -listnodes_sec:0.750000-0.850000(0.794+/-0.042) +listnodes_sec:1.120000-1.230000(1.176+/-0.044) -listchannels_sec:30.740000-31.760000(31.096+/-0.35) +listchannels_sec:38.900000-50.580000(44.716+/-3.9) -routing_sec:29.600000-33.560000(30.472+/-1.5) +routing_sec:45.080000-48.160000(46.814+/-1.1) -peer_write_all_sec:49.220000-52.690000(50.892+/-1.3) +peer_write_all_sec:58.780000-87.150000(72.278+/-9.7) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	1f08cfb3e3	gossipd: use file offset within store as broadcast index. Instead of an arbitrary counter, we can use the file offset for our partial ordering, removing a field. It takes some care when we compact the store, however, as this field changes. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:34271-35283(34789.6+/-3.3e+02) vsz_kb:2288784 store_rewrite_sec:38.060000-39.130000(38.426+/-0.39) listnodes_sec:0.750000-0.850000(0.794+/-0.042) listchannels_sec:30.740000-31.760000(31.096+/-0.35) routing_sec:29.600000-33.560000(30.472+/-1.5) peer_write_all_sec:49.220000-52.690000(50.892+/-1.3) MCP notable changes from previous patch (>1 stddev): -store_load_msec:35685-38538(37090.4+/-9.1e+02) +store_load_msec:34271-35283(34789.6+/-3.3e+02) -vsz_kb:2288768 +vsz_kb:2288784 -peer_write_all_sec:51.140000-58.350000(55.69+/-2.4) +peer_write_all_sec:49.220000-52.690000(50.892+/-1.3) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	eb4564c3cd	gossipd: embed broadcast information into each structure. This is more compact, but also required once we replace the arbitrary "index" with an actual offset into the gossip store. That will let us remove the in-memory variants entirely. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35685-38538(37090.4+/-9.1e+02) vsz_kb:2288768 store_rewrite_sec:35.530000-41.230000(37.904+/-2.3) listnodes_sec:0.720000-0.810000(0.762+/-0.041) listchannels_sec:30.750000-35.990000(32.704+/-2) routing_sec:29.570000-34.010000(31.374+/-1.8) peer_write_all_sec:51.140000-58.350000(55.69+/-2.4) MCP notable changes from previous patch (>1 stddev): -vsz_kb:2621808 +vsz_kb:2288768 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	617c23e735	gossipd: use u32 for timestamp. We used an s64 so we could use -1 and save a check, but that's just silly as we have adjacent non-u64 fields: wastes 7 bytes per node and 16 per channel. Interestingly, this seemed to make us a little slower for some reason. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35569-38776(37169.8+/-1.2e+03) vsz_kb:2621808 store_rewrite_sec:35.870000-40.290000(38.14+/-1.6) listnodes_sec:0.740000-0.800000(0.768+/-0.023) listchannels_sec:29.820000-32.730000(30.972+/-0.99) routing_sec:30.110000-30.590000(30.346+/-0.18) peer_write_all_sec:52.420000-59.160000(54.692+/-2.5) MCP notable changes from previous patch (>1 stddev): -store_load_msec:32825-36365(34615.6+/-1.1e+03) +store_load_msec:35569-38776(37169.8+/-1.2e+03) -vsz_kb:2637488 +vsz_kb:2621808 -store_rewrite_sec:35.150000-36.200000(35.59+/-0.4) +store_rewrite_sec:35.870000-40.290000(38.14+/-1.6) -listnodes_sec:0.590000-0.710000(0.682+/-0.046) +listnodes_sec:0.740000-0.800000(0.768+/-0.023) -peer_write_all_sec:49.020000-52.890000(50.376+/-1.5) +peer_write_all_sec:52.420000-59.160000(54.692+/-2.5) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	0b484b111e	gossipd: make more compact getchannels entries. We can save significant space by combining both sides: so much that we can reduce the WIRE_LEN_LIMIT to something sane again. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:34467-36764(35517.8+/-7.7e+02) vsz_kb:2637488 store_rewrite_sec:35.310000-36.580000(35.816+/-0.44) listnodes_sec:1.140000-2.780000(1.596+/-0.6) listchannels_sec:55.390000-58.110000(56.998+/-0.99) routing_sec:30.330000-30.920000(30.642+/-0.19) peer_write_all_sec:50.640000-53.360000(51.822+/-0.91) MCP notable changes from previous patch (>1 stddev): -store_rewrite_sec:34.720000-35.130000(34.94+/-0.14) +store_rewrite_sec:35.310000-36.580000(35.816+/-0.44) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	91849dddc4	wire: use struct node_id for node ids. Don't turn them to/from pubkeys implicitly. This means nodeids in the store don't get converted, but bitcoin keys still do. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:33934-35251(34531.4+/-5e+02) vsz_kb:2637488 store_rewrite_sec:34.720000-35.130000(34.94+/-0.14) listnodes_sec:1.020000-1.290000(1.146+/-0.086) listchannels_sec:51.110000-58.240000(54.826+/-2.5) routing_sec:30.000000-33.320000(30.726+/-1.3) peer_write_all_sec:50.370000-52.970000(51.646+/-1.1) MCP notable changes from previous patch (>1 stddev): -store_load_msec:46184-47474(46673.4+/-4.5e+02) +store_load_msec:33934-35251(34531.4+/-5e+02) -vsz_kb:2638880 +vsz_kb:2637488 -store_rewrite_sec:46.750000-48.280000(47.512+/-0.51) +store_rewrite_sec:34.720000-35.130000(34.94+/-0.14) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	a2fa699e0e	Use node_id everywhere for nodes. I tried to just do gossipd, but it was uncontainable, so this ended up being a complete sweep. We didn't get much space saving in gossipd, even though we should save 24 bytes per node. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	417e1bab7d	gossipd: use iterator helpers for iterating node channels. Makes the next step easier. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:45791-46917(46330.4+/-3.6e+02) vsz_kb:2641316 store_rewrite_sec:47.040000-48.720000(47.684+/-0.57) listnodes_sec:1.140000-1.340000(1.2+/-0.072) listchannels_sec:50.970000-54.250000(52.698+/-1.3) routing_sec:29.950000-31.010000(30.332+/-0.37) peer_write_all_sec:51.570000-52.970000(52.1+/-0.54) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	7c8f506a0f	dev-compact-store-gossip: specific RPC so we can test gossip_store rewrite. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	5b12007a4f	gossipd: dev option to allow unknown channels. This lets us benchmark without a valid blockchain. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Header from folded patch 'fixup!_gossipd__dev_option_to_allow_unknown_channels.patch': fixup! gossipd: dev option to allow unknown channels. Suggested-by: @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	f8f6533dba	dev: --dev-gossip-time so gossipd doesn't prune old data. This is useful for canned data, such as the million channels project. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	b2c93beaed	gossipd: use htable instead of simple array for node's channels. For giant nodes, it seems we spend a lot of time memmoving this array. Normally we'd go for a linked list, but that's actually hard: each channel has two nodes, so needs two embedded list pointers, and when iterating there's no good way to figure out which embedded pointer we'd be using. So we (ab)use htable; we don't really need an index, but it's good for cache-friendly iteration (our main operation). We can actually change to a hybrid later to avoid the extra allocation for small nodes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	28f5da7b2f	tools/generate-wire: use amount_msat / amount_sat for peer protocol. Basically we tell it that every field ending in '_msat' is a struct amount_msat, and 'satoshis' is an amount_sat. The exceptions are channel_update's fee_base_msat which is a u32, and final_incorrect_htlc_amount's incoming_htlc_amt which is also a 'struct amount_msat'. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 08:01:37 +00:00
Rusty Russell	3ac0e814d0	daemons: use amount_msat/amount_sat in all internal wire transfers. As a side-effect of using amount_msat in gossipd/routing.c, we explicitly handle overflows and don't need to pre-prune ridiculous-fee channels. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 08:01:37 +00:00
Rusty Russell	662bb0c565	gossipd: fix riskfactor passing. We used a u16, and a 1000 multiplier, which meant we wrapped at riskfactor 66. We also never undid the multiplier, so we ended up applying 1000x the riskfactor they specified. This changes us to pass the riskfactor with a 1M multiplier. The next patch changes the definition of riskfactor to be more useful. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-06 18:39:52 +01:00
Rusty Russell	6a26b0c18d	gossipd: increase randomness in route selection. We have a seed, which is for (future!) unit testing consistency. This makes it change every time, so our pay_direct_test is more useful. I tried restarting the noed around the loop, but it tended to fail rebinding to the same port for some reason? Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-06 18:39:52 +01:00
Rusty Russell	afab1f7b3c	gossipd: handle onion errors internally. As a general rule, lightningd shouldn't parse user packets. We move the parsing into gossipd, and have it respond only to permanent failures. Note that we should not unconditionally remove a channel on WIRE_INVALID_ONION_HMAC, as this can be triggered (and we do!) by feeding sendpay a route with an incorrect pubkey. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-23 22:08:08 +01:00
Rusty Russell	4eddf57fd9	gossipd: don't mark channels unroutable. For transient failures, the pay plugin should simply exclude those from route considerations. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-23 22:08:08 +01:00
Rusty Russell	018a3f1d58	short_channel_id: make mk_short_channel_id return a failure. We had a bug `0ba547ee10` caused by short_channel_id overflow. If we'd caught this, we'd have terminated the peer instead of crashing, so add appropriate checks. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-21 12:31:06 +01:00
Rusty Russell	0ba547ee10	gossipd: handle overflowing query properly (avoid slow 100% CPU reports) Don't do this: (gdb) bt #0 0x00007f37ae667c40 in ?? () from /lib/x86_64-linux-gnu/libz.so.1 #1 0x00007f37ae668b38 in ?? () from /lib/x86_64-linux-gnu/libz.so.1 #2 0x00007f37ae669907 in deflate () from /lib/x86_64-linux-gnu/libz.so.1 #3 0x00007f37ae674c65 in compress2 () from /lib/x86_64-linux-gnu/libz.so.1 #4 0x000000000040cfe3 in zencode_scids (ctx=0xc1f118, scids=0x2599bc49 "\a\325{", len=176320) at gossipd/gossipd.c:218 #5 0x000000000040d0b3 in encode_short_channel_ids_end (encoded=0x7fff8f98d9f0, max_bytes=65490) at gossipd/gossipd.c:236 #6 0x000000000040dd28 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290511, number_of_blocks=8) at gossipd/gossipd.c:576 #7 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290511, number_of_blocks=16) at gossipd/gossipd.c:595 #8 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290495, number_of_blocks=32) at gossipd/gossipd.c:596 #9 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290495, number_of_blocks=64) at gossipd/gossipd.c:595 #10 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=128) at gossipd/gossipd.c:596 #11 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=256) at gossipd/gossipd.c:595 #12 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=512) at gossipd/gossipd.c:595 #13 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=1024) at gossipd/gossipd.c:595 #14 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=2047) at gossipd/gossipd.c:596 #15 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=4095) at gossipd/gossipd.c:595 #16 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=8191) at gossipd/gossipd.c:595 #17 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=16382) at gossipd/gossipd.c:595 #18 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=32764) at gossipd/gossipd.c:595 #19 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=65528) at gossipd/gossipd.c:595 #20 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=131056) at gossipd/gossipd.c:595 #21 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=262112) at gossipd/gossipd.c:595 #22 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=524225) at gossipd/gossipd.c:595 #23 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=1048450) at gossipd/gossipd.c:595 #24 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=2096900) at gossipd/gossipd.c:595 #25 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=4193801) at gossipd/gossipd.c:595 #26 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=8387603) at gossipd/gossipd.c:595 #27 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=16775207) at gossipd/gossipd.c:595 #28 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=33550414) at gossipd/gossipd.c:596 #29 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=67100829) at gossipd/gossipd.c:595 #30 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=134201659) at gossipd/gossipd.c:595 #31 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=268403318) at gossipd/gossipd.c:595 #32 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=536806636) at gossipd/gossipd.c:595 #33 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=1073613273) at gossipd/gossipd.c:595 #34 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=2147226547) at gossipd/gossipd.c:595 #35 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=4294453094) at gossipd/gossipd.c:595 #36 0x000000000040df26 in handle_query_channel_range (peer=0x3868fc8, msg=0x37e0678 "\001\ao\342\214\n\266\361\263r\301\246\242F\256c\367O\223\036\203e\341Z\b\234h\326\031") at gossipd/gossipd.c:625 The cause was that converting a block number to an scid truncates it at 24 bits. When we look through the index from (truncated number) to (real end number) we get every channel, which is too large to encode, so we iterate again. This fixes both that problem, and also the issue that we'd end up dividing into many empty sections until we get to the highest block number. Instead, we just tack the empty blocks on to then end of the final query. (My initial version requested 0xFFFFFFFE blocks, but the dev code which records what blocks were returned can't make a bitmap that big on 32 bit). Reported-by: George Vaccaro Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 11:34:45 -08:00
Rusty Russell	9f1f79587e	short_channel_id_dir: new primitive for one direction of short_channel_id Currently only used by gossipd for channel elimination. Also print them in canonical form (/[01]), so tests need to be changed. Suggested-by: @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	dc2ee9639b	listchannels: allow source arg to list channels by their source node. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	358b7fda91	getroute: allow caller to specify maximum hops. This is required for routeboost. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	599ec5efbe	gossipd: allow an array of excluded channels for getroute_request. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	1567238dd9	invoice: option to expose/not-expose private channels. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	fe4a600bc7	routeboost: don't use channels to dead-end nodes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	547d6ab878	routeboost: expose private channel in invoice iff we have no public ones. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	f321b1d35f	getroute: remove seed arg, document fromid, make default fuzzpercent match docs. seed isn't very useful at this level: I've left it in routing.c because it might be useful for detailed testing. Pretty sure it's unused, so I simply removed it. The fuzzpercent is documented to default at 5%, but actually was 75%. Fix that too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	26dda57cc0	utils: make tal_arr_expand safer. Christian and I both unwittingly used it in form: tal_arr_expand(&x) = tal(x, ...) Since '=' isn't a sequence point, the compiler can (and does!) cache the value of x, handing it to tal after* tal_arr_expand() moves it due to tal_resize(). The new version is somewhat less convenient to use, but doesn't have this problem, since the assignment is always evaluated after the resize. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Christian Decker	65054ae72e	bolt: Updated the BOLT specification to a07dc3df3b4611989e3359f28f96c574f7822850 This is mainly just copying over the copy-editing from the lightning-rfc repository. [ Split to just perform changes prior to the UNKNOWN_PAYMENT_HASH change --RR ] Signed-off-by: Christian Decker <decker.christian@gmail.com> Reported-by: Rusty Russell <@rustyrussell>	2019-01-15 02:19:56 +00:00
Rusty Russell	ab735dcbe6	gossipd: wire up memleak detection. For simplicity we dump leaks to logs, and just return a bool to master. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-22 05:15:42 +00:00
Rusty Russell	78771ca371	gossipd: mark timers as not being leaks. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-22 05:15:42 +00:00
Rusty Russell	9620393109	gossipd: store chainparams internally. We keep a chain_hash in struct daemon, becayse otherwise we end up with `&peer->daemon->rstate->chainparams->genesis_blockhash` which is a bit ridiculous. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 21:43:37 +00:00
Rusty Russell	5312ec1e34	gossipd: add documentation comments now it's relatively understandable. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	ea2c03e2e2	gossipd: don't have code to exit final loop; we always leave via master_gone. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	4038061d0f	gossipd: use take() in getroute_req. Trivial optimization. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	5c60d7ffb2	gossipd: split wire types into msgs from lightningd and msgs from per-peer daemons This avoids some very ugly switch() statements which mixed the two, but we also take the chance to rename 'towire_gossip_' to 'towire_gossipd_' for those inter-daemon messages; they're messages to gossipd, not gossip messages. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	07b16e37d0	daemon_conn: don't rely on outq_empty callback telling us to retry queue. We had at least one bug caused by it not returning true when it had queued something. Instead, just re-check thq queue after it's called. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	4e9eba1965	gossipd: rework query_channel_range to accept overlapping range. We shouldn't insist on an exact reponse match: they can batch it and send a whole batch, as long as it overlaps what we ask. We also change to a bitmap to save some memory. This isn't note in the CHANGELOG since we don't actually send gossip range queries except for testing. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	363564301f	gossipd: be more rigorous in handling peer messages vs. daemon requests. Messages from a peer may be invalid in many ways: we send an error packet in that case. Rather than internally calling peer_error, however, we make it explicit by having the handle_ functions return NULL or an error packet. Messages from the daemon itself should not be invalid: we log an error and close the fd to them if it is. Previously we logged an error but didn't kill them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	1bd76861fd	gossipd: reorder functions into related groups (MOVEONLY) It's MOVEONLY but for the removal of the '#ifndef TESTING' which was needed for old test code. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	3c97f3954e	daemon_conn: make it a tal object, typesafe callbacks. It means an extra allocation at startup, but it means we can hide the definition, and use standard patterns (new_daemon_conn and typesafe callbacks). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-29 04:06:16 +00:00
Rusty Russell	0e6aec081a	gossipd: make sure that freeing peer closes connection to it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-29 04:06:16 +00:00
Rusty Russell	689d51cba5	common/daemon_conn: remove finished function. For the moment, caller sets it manually. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-29 04:06:16 +00:00
Rusty Russell	66dcba099d	gossipd: hand raw pubkeys in getnodes and getchannels entries. We spend quite a bit of time in libsecp256k1 moving them to and from DER encoding. With a bit of care, we can transfer the raw bytes from gossipd and manually decode them so a malformed one can't make us abort(). Before: real 0m0.629000-0.695000(0.64985+/-0.019)s After: real 0m0.359000-0.433000(0.37645+/-0.023)s At this point, the main issues are 11% of time spent in ccan/io's backend_wake (I tried using a hash table there, but that actually makes the small-number-of-fds case slower), and 65% of gossipd's time is in marshalling the response (all those tal_resize add up!). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-19 22:02:11 +00:00
Rusty Russell	bbc36a7bec	gossipd: update node announcement even if we change within a second. Usually Travis triggers corner cases because it's so slow, but this time the moons aligned, and it managed to fail test_node_reannounce because it generated the updated node_announcement with the same timestamp as the old one. This is because we only updated "last_announce_timestamp" when we generated the announcement, not when we got it off the wire or loaded it from the gossip store. The fix is to ask the routing code what the latest timestamp is; we could still generate a clashing timestamp if (1) the gossip store is lost, and (2) we restart within one second. Hard to care. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-16 04:24:03 +00:00
lisa neigut	0ae1d03513	BOLT7: broadcast `htlc_maximum_msat` in `channel_update s Have c-lightning nodes send out the largest value for `htlc_maximum_msat` that makes sense, ie the lesser of the peer's max_inflight_htlc value or the total channel capacity minus the total channel reserve.	2018-10-16 03:32:27 +00:00
Rusty Russell	afac01380d	gossipd: don't initialize broadcast interval, make field name explicit. We initialize it to 30 seconds, but it's always overridden by the gossip_init message (and usually to 60 seconds, so it's doubly misleading). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-15 23:04:17 +00:00
Rusty Russell	3991425111	gossipd: don't accept forwarding short_channel_ids we don't own. Gossipd provided a generic "get endpoints of this scid" and we only use it in one place: to look up htlc forwards. But lightningd just assumed that one would be us. Instead, provide a simpler API which only returns the peer node if any, and now we handle it much more gracefully. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-15 23:04:17 +00:00
Rusty Russell	030fe1ce53	gossipd: don't expose private channels for routeboost. We don't create unannouncable channels, but other implementations can. Not only is it rude to expose these via invoices, it's probably not useable anyway. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-15 23:04:17 +00:00

1 2 3 4 5 ...

269 Commits