rgb-cln

Commit Graph

Author	SHA1	Message	Date
trueptolemy	ee036a2e36	Gossipd: change the pending_cannouncement list to htable	2019-04-14 05:39:31 +00:00
Rusty Russell	261921dee2	gossipd: adjust peers' broadcast_offset when compacting store. When we compact the store, we need to adjust the broadast index for peers so they know where they're up to. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	fdb42c3170	gossipd: don't keep channel_updates in memory. This requires some trickiness when we want to re-add unannounced channels to the store after compaction, so we extract a common "copy_message" to transfer from old store to new. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:36034-37853(37109.8+/-5.9e+02) vsz_kb:577456 store_rewrite_sec:12.490000-13.250000(12.862+/-0.27) listnodes_sec:1.250000-1.480000(1.364+/-0.09) listchannels_sec:30.820000-31.480000(31.068+/-0.24) routing_sec:26.940000-27.990000(27.616+/-0.39) peer_write_all_sec:65.690000-68.600000(66.698+/-0.99) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1202316 +vsz_kb:577456 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	0370ed2eca	gossipd: use pread in the store. The next patch causes us to access the store while loading (we read channel_updates for local peers), which messes up loading due to the lseek involved. Using pread() is atomic with seek & read, and also a bit more efficient. Make the header contiguous too, while we're here. We don't need pwrite: we always open with O_APPEND which means the seek-to-end is implicit. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:36771-38289(37529.6+/-5.3e+02) vsz_kb:1202316 store_rewrite_sec:12.460000-13.280000(12.784+/-0.29) listnodes_sec:1.240000-1.410000(1.34+/-0.058) listchannels_sec:29.850000-31.840000(30.908+/-0.69) routing_sec:27.800000-31.790000(28.822+/-1.5) peer_write_all_sec:66.200000-68.720000(67.44+/-0.84) MCP notable changes from previous patch (>1 stddev): -store_load_msec:39207-45089(41374.6+/-2.2e+03) +store_load_msec:36771-38289(37529.6+/-5.3e+02) -store_rewrite_sec:15.090000-16.790000(15.654+/-0.63) +store_rewrite_sec:12.460000-13.280000(12.784+/-0.29) -peer_write_all_sec:66.830000-76.850000(71.976+/-3.6) +peer_write_all_sec:66.200000-68.720000(67.44+/-0.84) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	2135c7a024	gossipd: allow reading from the store during load. When we no longer keep channel_updates in memory, there's a path where we access them on load: when we promote a local channel to an announced channel. This breaks at the moment, since gs->fd == -1; change it to a writable flag instead. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	aeb72a05e3	gossipd: remove some fields from struct chan. The txout_script field is unused; the local_disable only applies to the handful of local channels, so move that into a hash table. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:39207-45089(41374.6+/-2.2e+03) vsz_kb:1202316 store_rewrite_sec:15.090000-16.790000(15.654+/-0.63) listnodes_sec:1.290000-3.790000(1.938+/-0.93) listchannels_sec:30.190000-32.120000(31.31+/-0.69) routing_sec:28.220000-31.340000(29.314+/-1.2) peer_write_all_sec:66.830000-76.850000(71.976+/-3.6) MCP notable changes from previous patch (>1 stddev): -store_load_msec:35107-37944(36686+/-1e+03) +store_load_msec:39207-45089(41374.6+/-2.2e+03) -vsz_kb:1218036 +vsz_kb:1202316 -listchannels_sec:28.510000-30.270000(29.6+/-0.6) +listchannels_sec:30.190000-32.120000(31.31+/-0.69) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	3280466e19	gossipd: don't keep channel_announcement messages in memory. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35107-37944(36686+/-1e+03) vsz_kb:1218036 store_rewrite_sec:14.060000-17.970000(15.966+/-1.6) listnodes_sec:1.270000-1.350000(1.314+/-0.034) listchannels_sec:28.510000-30.270000(29.6+/-0.6) routing_sec:30.230000-31.510000(30.83+/-0.44) peer_write_all_sec:67.390000-70.710000(68.568+/-1.2) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1780516 +vsz_kb:1218036 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	2fd4a0121f	gossipd: unify is_chan_public / is_chan_announced. We used to have a `struct chan` while we're waiting for an update; now we keep that internally. So a `struct chan` without a channel_announcement in the store is private, and other is public. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	aafc489edb	gossipd: remove info fields from struct node. Reload them from disk if they do listnodes. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35390-38659(37336.4+/-1.3e+03) vsz_kb:1780516 store_rewrite_sec:13.800000-16.800000(15.02+/-0.98) listnodes_sec:1.280000-1.530000(1.382+/-0.096) listchannels_sec:28.700000-30.440000(29.34+/-0.68) routing_sec:30.120000-31.080000(30.526+/-0.35) peer_write_all_sec:65.910000-76.850000(69.462+/-4.1) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1792996 +vsz_kb:1780516 -listnodes_sec:1.030000-1.120000(1.068+/-0.032) +listnodes_sec:1.280000-1.530000(1.382+/-0.096) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	0608c36301	gossipd: don't keep node_announcement messages in memory. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:34779-38628(36903.4+/-1.4e+03) vsz_kb:1792996 store_rewrite_sec:14.440000-15.040000(14.672+/-0.24) listnodes_sec:1.030000-1.120000(1.068+/-0.032) listchannels_sec:27.860000-32.850000(30.05+/-1.7) routing_sec:30.020000-31.700000(31.044+/-0.56) peer_write_all_sec:65.100000-70.600000(68.422+/-2) -vsz_kb:1780516 +vsz_kb:1792996 -listnodes_sec:1.280000-1.530000(1.382+/-0.096) +listnodes_sec:1.030000-1.120000(1.068+/-0.032) MCP notable changes from previous patch (>1 stddev): -store_load_msec:30640-33236(32202+/-8.7e+02) +store_load_msec:34779-38628(36903.4+/-1.4e+03) -vsz_kb:1812956 +vsz_kb:1792996 -listnodes_sec:0.590000-0.660000(0.62+/-0.033) +listnodes_sec:1.030000-1.120000(1.068+/-0.032) -peer_write_all_sec:60.380000-61.320000(60.836+/-0.37) +peer_write_all_sec:65.100000-70.600000(68.422+/-2) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	cb297b0a1b	gossipd: free tmpctx children in gossip_store_load loop. We're accumulating children, and we'll get more in the successive patches. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	3ef767fd52	gossipd: don't use cached node_announcement for redundancy checking Re-parse the existing message, since we'e going to get rid of those fields. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	e02f5817fe	gossipd: don't create struct chan for yet-to-be-updated channels. We currently create a struct chan when we receive a `channel_announcement`, but we can only broadcast once we have a `channel_update` (since that provides the timestamp). This means a `struct chan` can be in a weird state where it exists, but is unusable (can't use without an update), and also means we need to keep the channel_announcement message around until an update arrives, so we can put it in the gossip_store. Instead, keep track of these "unupdated" channels separately, and check for them in all the places we search for a specific channel to update. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:30640-33236(32202+/-8.7e+02) vsz_kb:1812956 store_rewrite_sec:13.410000-16.970000(14.438+/-1.3) listnodes_sec:0.590000-0.660000(0.62+/-0.033) listchannels_sec:28.140000-29.560000(28.816+/-0.56) routing_sec:29.530000-32.590000(30.352+/-1.1) peer_write_all_sec:60.380000-61.320000(60.836+/-0.37) MCP notable changes from previous patch (>1 stddev): -vsz_kb:1812904 +vsz_kb:1812956 -store_rewrite_sec:21.390000-27.070000(23.596+/-2.4) +store_rewrite_sec:13.410000-16.970000(14.438+/-1.3) -listnodes_sec:1.120000-1.230000(1.176+/-0.044) +listnodes_sec:0.590000-0.660000(0.62+/-0.033) -listchannels_sec:38.900000-50.580000(44.716+/-3.9) +listchannels_sec:28.140000-29.560000(28.816+/-0.56) -routing_sec:45.080000-48.160000(46.814+/-1.1) +routing_sec:29.530000-32.590000(30.352+/-1.1) -peer_write_all_sec:58.780000-87.150000(72.278+/-9.7) +peer_write_all_sec:60.380000-61.320000(60.836+/-0.37) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	d8aee68ba8	gossipd: handle duplicate nodes from unverified channel_announces properly. If we have a channel_announcement, we catch any node_announcement for either end while we validate the channel_announcement. But if we have multiple channel_announcements and the first one failed to verify, it would remove this catch, meaning we'd discard following node_announcements even though there was a pending channel_announcement. The answer is to use a simple reference count, and as a further optimization, only place the `pending_node_announce` if there's no node already. We also move the process_pending_node_announcement() calls lower down, so any new channel creation checks it. This is more robust, and will prove useful for the next patch, where we can use the same mechanism to handle node_announcements on channel_announcements which are verified, but don't yet have a channel_update. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	da884751e8	gossipd: make routing_add_channel_update discard old timestamps. This is currently done higher up, in handle_channel_update(), but that's one reason why handle_channel_update() has to do a channel lookup. Moving the check down means handle_channel_update() can do a minimal "get node id for this channel" so it can check the signature. This helps, because the chan lookup semantics are changing in the next few patches. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	6b9069ee28	broadcast: don't keep payload pointer. If we need the payload, pull it from the gossip store. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:30189-52561(39416.4+/-8.8e+03) vsz_kb:1812904 store_rewrite_sec:21.390000-27.070000(23.596+/-2.4) listnodes_sec:1.120000-1.230000(1.176+/-0.044) listchannels_sec:38.900000-50.580000(44.716+/-3.9) routing_sec:45.080000-48.160000(46.814+/-1.1) peer_write_all_sec:58.780000-87.150000(72.278+/-9.7) MCP notable changes from previous patch (>1 stddev): -vsz_kb:2288784 +vsz_kb:1812904 -store_rewrite_sec:38.060000-39.130000(38.426+/-0.39) +store_rewrite_sec:21.390000-27.070000(23.596+/-2.4) -listnodes_sec:0.750000-0.850000(0.794+/-0.042) +listnodes_sec:1.120000-1.230000(1.176+/-0.044) -listchannels_sec:30.740000-31.760000(31.096+/-0.35) +listchannels_sec:38.900000-50.580000(44.716+/-3.9) -routing_sec:29.600000-33.560000(30.472+/-1.5) +routing_sec:45.080000-48.160000(46.814+/-1.1) -peer_write_all_sec:49.220000-52.690000(50.892+/-1.3) +peer_write_all_sec:58.780000-87.150000(72.278+/-9.7) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	da845b660b	gossipd: gossip_store_get() to load a single store entry. This will allow us to load on demand, and not keep all messages in memory. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	1f08cfb3e3	gossipd: use file offset within store as broadcast index. Instead of an arbitrary counter, we can use the file offset for our partial ordering, removing a field. It takes some care when we compact the store, however, as this field changes. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:34271-35283(34789.6+/-3.3e+02) vsz_kb:2288784 store_rewrite_sec:38.060000-39.130000(38.426+/-0.39) listnodes_sec:0.750000-0.850000(0.794+/-0.042) listchannels_sec:30.740000-31.760000(31.096+/-0.35) routing_sec:29.600000-33.560000(30.472+/-1.5) peer_write_all_sec:49.220000-52.690000(50.892+/-1.3) MCP notable changes from previous patch (>1 stddev): -store_load_msec:35685-38538(37090.4+/-9.1e+02) +store_load_msec:34271-35283(34789.6+/-3.3e+02) -vsz_kb:2288768 +vsz_kb:2288784 -peer_write_all_sec:51.140000-58.350000(55.69+/-2.4) +peer_write_all_sec:49.220000-52.690000(50.892+/-1.3) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	ec50ec6a71	gossipd: make gossip loading stats accurate. They didn't count the header sizes when reporting bytes, which is misleading. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	eb4564c3cd	gossipd: embed broadcast information into each structure. This is more compact, but also required once we replace the arbitrary "index" with an actual offset into the gossip store. That will let us remove the in-memory variants entirely. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35685-38538(37090.4+/-9.1e+02) vsz_kb:2288768 store_rewrite_sec:35.530000-41.230000(37.904+/-2.3) listnodes_sec:0.720000-0.810000(0.762+/-0.041) listchannels_sec:30.750000-35.990000(32.704+/-2) routing_sec:29.570000-34.010000(31.374+/-1.8) peer_write_all_sec:51.140000-58.350000(55.69+/-2.4) MCP notable changes from previous patch (>1 stddev): -vsz_kb:2621808 +vsz_kb:2288768 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	62918fcb3b	gossip_store: avoid gratuitous copy on load. Doesn't make measurable difference, but an obvious optimization. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	617c23e735	gossipd: use u32 for timestamp. We used an s64 so we could use -1 and save a check, but that's just silly as we have adjacent non-u64 fields: wastes 7 bytes per node and 16 per channel. Interestingly, this seemed to make us a little slower for some reason. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:35569-38776(37169.8+/-1.2e+03) vsz_kb:2621808 store_rewrite_sec:35.870000-40.290000(38.14+/-1.6) listnodes_sec:0.740000-0.800000(0.768+/-0.023) listchannels_sec:29.820000-32.730000(30.972+/-0.99) routing_sec:30.110000-30.590000(30.346+/-0.18) peer_write_all_sec:52.420000-59.160000(54.692+/-2.5) MCP notable changes from previous patch (>1 stddev): -store_load_msec:32825-36365(34615.6+/-1.1e+03) +store_load_msec:35569-38776(37169.8+/-1.2e+03) -vsz_kb:2637488 +vsz_kb:2621808 -store_rewrite_sec:35.150000-36.200000(35.59+/-0.4) +store_rewrite_sec:35.870000-40.290000(38.14+/-1.6) -listnodes_sec:0.590000-0.710000(0.682+/-0.046) +listnodes_sec:0.740000-0.800000(0.768+/-0.023) -peer_write_all_sec:49.020000-52.890000(50.376+/-1.5) +peer_write_all_sec:52.420000-59.160000(54.692+/-2.5) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-11 18:31:34 -07:00
Rusty Russell	0b484b111e	gossipd: make more compact getchannels entries. We can save significant space by combining both sides: so much that we can reduce the WIRE_LEN_LIMIT to something sane again. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:34467-36764(35517.8+/-7.7e+02) vsz_kb:2637488 store_rewrite_sec:35.310000-36.580000(35.816+/-0.44) listnodes_sec:1.140000-2.780000(1.596+/-0.6) listchannels_sec:55.390000-58.110000(56.998+/-0.99) routing_sec:30.330000-30.920000(30.642+/-0.19) peer_write_all_sec:50.640000-53.360000(51.822+/-0.91) MCP notable changes from previous patch (>1 stddev): -store_rewrite_sec:34.720000-35.130000(34.94+/-0.14) +store_rewrite_sec:35.310000-36.580000(35.816+/-0.44) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	91849dddc4	wire: use struct node_id for node ids. Don't turn them to/from pubkeys implicitly. This means nodeids in the store don't get converted, but bitcoin keys still do. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:33934-35251(34531.4+/-5e+02) vsz_kb:2637488 store_rewrite_sec:34.720000-35.130000(34.94+/-0.14) listnodes_sec:1.020000-1.290000(1.146+/-0.086) listchannels_sec:51.110000-58.240000(54.826+/-2.5) routing_sec:30.000000-33.320000(30.726+/-1.3) peer_write_all_sec:50.370000-52.970000(51.646+/-1.1) MCP notable changes from previous patch (>1 stddev): -store_load_msec:46184-47474(46673.4+/-4.5e+02) +store_load_msec:33934-35251(34531.4+/-5e+02) -vsz_kb:2638880 +vsz_kb:2637488 -store_rewrite_sec:46.750000-48.280000(47.512+/-0.51) +store_rewrite_sec:34.720000-35.130000(34.94+/-0.14) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	a2fa699e0e	Use node_id everywhere for nodes. I tried to just do gossipd, but it was uncontainable, so this ended up being a complete sweep. We didn't get much space saving in gossipd, even though we should save 24 bytes per node. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	d4ab0592c5	fixup! gossipd: use simple inline array for nodes with few channels. Suggested-by: @cdecker Suggested-by: @niftynei	2019-04-09 12:37:16 -07:00
Rusty Russell	b6494c1994	gossipd: use simple inline array for nodes with few channels. Allocating a htable is overkill for most nodes; we can fit 11 pointers in the same space (10, since we use 1 to indicate we're using an array). MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:45947-47016(46683.4+/-4e+02) vsz_kb:2639240 store_rewrite_sec:46.950000-49.830000(48.048+/-0.95) listnodes_sec:1.090000-1.350000(1.196+/-0.095) listchannels_sec:48.960000-57.640000(53.358+/-2.8) routing_sec:29.990000-33.880000(31.088+/-1.4) peer_write_all_sec:49.360000-53.210000(51.338+/-1.4) MCP notable changes from previous patch (>1 stddev): - vsz_kb:2641316 + vsz_kb:2639240 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	417e1bab7d	gossipd: use iterator helpers for iterating node channels. Makes the next step easier. MCP results from 5 runs, min-max(mean +/- stddev): store_load_msec:45791-46917(46330.4+/-3.6e+02) vsz_kb:2641316 store_rewrite_sec:47.040000-48.720000(47.684+/-0.57) listnodes_sec:1.140000-1.340000(1.2+/-0.072) listchannels_sec:50.970000-54.250000(52.698+/-1.3) routing_sec:29.950000-31.010000(30.332+/-0.37) peer_write_all_sec:51.570000-52.970000(52.1+/-0.54) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-09 12:37:16 -07:00
Rusty Russell	891ee20a59	tools/bench-gossipd.sh: rough benchmark for gossipd and the million channels project Outputs CSV. We add some stats for load times in developer mode, so we can easily read them out. peer_read_all_sec doesn't work, since we seem to reject about half the updates for having bad signatures. It's also very slow... routing fails, for unknown reasons, so that failure is ignored in routing_sec. Results from 5 runs, min-max(mean +/- stddev): store_load_msec,vsz_kb,store_rewrite_sec,listnodes_sec,listchannels_sec,routing_sec,peer_write_all_sec 39275-44779(40466.8+/-2.2e+03),2899248,41.010000-44.970000(41.972+/-1.5),2.280000-2.350000(2.304+/-0.025),49.770000-63.390000(59.178+/-5),33.310000-34.260000(33.62+/-0.35),42.100000-44.080000(43.082+/-0.67) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Header from folded patch 'fixup!_tools-bench-gossipd.sh__rough_benchmark_for_gossipd_and_the_million_channels_project-2.patch': fixup! tools/bench-gossipd.sh: rough benchmark for gossipd and the million channels project Suggested-by: @niftynei Header from folded patch 'fixup!_tools-bench-gossipd.sh__rough_benchmark_for_gossipd_and_the_million_channels_project-1.patch': fixup! tools/bench-gossipd.sh: rough benchmark for gossipd and the million channels project MCP filename change. Header from folded patch 'tools-bench-gossipd.sh__dont_print_csv_by_default.patch': tools/bench-gossipd.sh: don't print CSV by default. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Header from folded patch 'fixup!_tools-bench-gossipd.sh__rough_benchmark_for_gossipd_and_the_million_channels_project.patch': fixup! tools/bench-gossipd.sh: rough benchmark for gossipd and the million channels project Make shellcheck happy. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	2bd7df93c6	gossipd: preserve unannounced channels across store compaction. Otherwise we'd forget them on restart, again. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	c424c42668	gossipd: store local channel updates across restart, even if unannounced. Either private or simply not enough confirms. They would have been added on reconnect, but that's not ideal. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	7c8f506a0f	dev-compact-store-gossip: specific RPC so we can test gossip_store rewrite. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	5b12007a4f	gossipd: dev option to allow unknown channels. This lets us benchmark without a valid blockchain. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Header from folded patch 'fixup!_gossipd__dev_option_to_allow_unknown_channels.patch': fixup! gossipd: dev option to allow unknown channels. Suggested-by: @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	f8f6533dba	dev: --dev-gossip-time so gossipd doesn't prune old data. This is useful for canned data, such as the million channels project. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Rusty Russell	b2c93beaed	gossipd: use htable instead of simple array for node's channels. For giant nodes, it seems we spend a lot of time memmoving this array. Normally we'd go for a linked list, but that's actually hard: each channel has two nodes, so needs two embedded list pointers, and when iterating there's no good way to figure out which embedded pointer we'd be using. So we (ab)use htable; we don't really need an index, but it's good for cache-friendly iteration (our main operation). We can actually change to a hybrid later to avoid the extra allocation for small nodes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-04-08 04:41:43 +00:00
Christian Decker	f3c234529e	gossip: Cache txout query failures If we asked `bitcoind` for a txout and it failed we were not storing that information anywhere, meaning that when we see the channel announcement the next time we'd be reaching out to `lightningd` and `bitcoind` again, just to see it fail again. This adds an in-memory cache for these failures so we can just ignore these the next time around. Fixes #2503 Signed-off-by: Christian Decker <decker.christian@gmail.com>	2019-04-01 23:54:19 +00:00
Christian Decker	426b22fdcb	gossip: Bump `gossip_getnodes_reply` result count to be u32 as well Otherwise we'll just have the same issue once we reach 65k nodes. Signed-off-by: Christian Decker <decker.christian@gmail.com>	2019-03-27 12:48:52 +01:00
Christian Decker	25e829c7d1	gossip: Make the `listchannels` reply result count a u32 Fixes #2504 Signed-off-by: Christian Decker <decker.christian@gmail.com> Reported-by: Antoine Le Calvez <@alecalve>	2019-03-27 12:48:52 +01:00
Rusty Russell	00f3a84af2	test: fix thinko in gossipd/test/run-bench-find_route.c Reported-by: @cdecker Fixes: #2440 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-03-05 11:42:43 +01:00
Rusty Russell	38e7d19dd5	Makefile: check for direct amount_sat/amount_msat access. We need to do it in various places, but we shouldn't do it lightly: the primitives are there to help us get overflow handling correct. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 08:01:37 +00:00
Rusty Russell	28f5da7b2f	tools/generate-wire: use amount_msat / amount_sat for peer protocol. Basically we tell it that every field ending in '_msat' is a struct amount_msat, and 'satoshis' is an amount_sat. The exceptions are channel_update's fee_base_msat which is a u32, and final_incorrect_htlc_amount's incoming_htlc_amt which is also a 'struct amount_msat'. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 08:01:37 +00:00
Rusty Russell	3ac0e814d0	daemons: use amount_msat/amount_sat in all internal wire transfers. As a side-effect of using amount_msat in gossipd/routing.c, we explicitly handle overflows and don't need to pre-prune ridiculous-fee channels. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 08:01:37 +00:00
Rusty Russell	85b8b25749	bitcoin/chainparams: use amount_sat / amount_msat Simple changes, but ripples through the code. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 08:01:37 +00:00
Rusty Russell	83adb94583	lightningd and routing: use struct amount_msat. We use it in route_hop, and paper over it in the JSON APIs. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 03:44:44 +00:00
Rusty Russell	7fad7bccba	common/amount: new types struct amount_msat and struct amount_sat. They're generally used pass-by-copy (unusual for C structs, but convenient they're basically u64) and all possibly problematic operations return WARN_UNUSED_RESULT bool to make you handle the over/underflow cases. The new #include in json.h means we bolt11.c sees the amount.h definition of MSAT_PER_BTC, so delete its local version. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-21 00:44:57 +00:00
Michael Schmoock	302a78f4eb	fix: add inline exception for recent cppcheck false positive	2019-02-18 01:06:01 +00:00
Rusty Russell	b99293fbb6	short_channel_id: don't accept :-separated in JSON if --allow-deprecated-apis=false We need to still accept it when parsing the database, but this flag should allow upgrade testing for devs building on top Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-08 16:52:30 -08:00
Rusty Russell	3ae0c20026	getroute: change definition (and pay default) for riskfactor. Up until now, riskfactor was useless due to implementation bugs, and also the default setting is wrong (too low to have an effect on reasonable payment scenarios). Let's simplify the definition (by assuming that P(failure) of a node is 1), to make it a simple percentage. I examined the current network fees to see what would work, and under this definition, a default of 10 seems reasonable (equivalent to 1000 under the old definition). It is this change which finally fixes our test case! The riskfactor is now 40msat (1500000 * 14 * 10 / 5259600 = 39.9), comparable with worst-case fuzz is 50msat (1001 * 0.05 = 50). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-06 18:39:52 +01:00
Rusty Russell	05f95b59c1	gossipd: take into account risk in final route comparison. We were only comparing by total msatoshis. Note, this still isn't sufficient to fix our indirect problem, as our risk values are all 1 (the minimum): lightning_gossipd(25480): 2 hop solution: 1501990 + 2 lightning_gossipd(25480): 3 hop solution: 1501971 + 3 ... lightning_gossipd(25480): => chose 3 hop solution Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-06 18:39:52 +01:00
Rusty Russell	662bb0c565	gossipd: fix riskfactor passing. We used a u16, and a 1000 multiplier, which meant we wrapped at riskfactor 66. We also never undid the multiplier, so we ended up applying 1000x the riskfactor they specified. This changes us to pass the riskfactor with a 1M multiplier. The next patch changes the definition of riskfactor to be more useful. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-06 18:39:52 +01:00
Rusty Russell	6a26b0c18d	gossipd: increase randomness in route selection. We have a seed, which is for (future!) unit testing consistency. This makes it change every time, so our pay_direct_test is more useful. I tried restarting the noed around the loop, but it tended to fail rebinding to the same port for some reason? Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-02-06 18:39:52 +01:00
Rusty Russell	afab1f7b3c	gossipd: handle onion errors internally. As a general rule, lightningd shouldn't parse user packets. We move the parsing into gossipd, and have it respond only to permanent failures. Note that we should not unconditionally remove a channel on WIRE_INVALID_ONION_HMAC, as this can be triggered (and we do!) by feeding sendpay a route with an incorrect pubkey. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-23 22:08:08 +01:00
Rusty Russell	4eddf57fd9	gossipd: don't mark channels unroutable. For transient failures, the pay plugin should simply exclude those from route considerations. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-23 22:08:08 +01:00
Rusty Russell	018a3f1d58	short_channel_id: make mk_short_channel_id return a failure. We had a bug `0ba547ee10` caused by short_channel_id overflow. If we'd caught this, we'd have terminated the peer instead of crashing, so add appropriate checks. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-21 12:31:06 +01:00
Rusty Russell	e2777642c0	getroute: add direction to route returned. We also ignore it in sendpay. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-17 13:02:24 +01:00
Rusty Russell	0ba547ee10	gossipd: handle overflowing query properly (avoid slow 100% CPU reports) Don't do this: (gdb) bt #0 0x00007f37ae667c40 in ?? () from /lib/x86_64-linux-gnu/libz.so.1 #1 0x00007f37ae668b38 in ?? () from /lib/x86_64-linux-gnu/libz.so.1 #2 0x00007f37ae669907 in deflate () from /lib/x86_64-linux-gnu/libz.so.1 #3 0x00007f37ae674c65 in compress2 () from /lib/x86_64-linux-gnu/libz.so.1 #4 0x000000000040cfe3 in zencode_scids (ctx=0xc1f118, scids=0x2599bc49 "\a\325{", len=176320) at gossipd/gossipd.c:218 #5 0x000000000040d0b3 in encode_short_channel_ids_end (encoded=0x7fff8f98d9f0, max_bytes=65490) at gossipd/gossipd.c:236 #6 0x000000000040dd28 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290511, number_of_blocks=8) at gossipd/gossipd.c:576 #7 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290511, number_of_blocks=16) at gossipd/gossipd.c:595 #8 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290495, number_of_blocks=32) at gossipd/gossipd.c:596 #9 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290495, number_of_blocks=64) at gossipd/gossipd.c:595 #10 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=128) at gossipd/gossipd.c:596 #11 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=256) at gossipd/gossipd.c:595 #12 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=512) at gossipd/gossipd.c:595 #13 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=1024) at gossipd/gossipd.c:595 #14 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=2047) at gossipd/gossipd.c:596 #15 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=4095) at gossipd/gossipd.c:595 #16 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=8191) at gossipd/gossipd.c:595 #17 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=16382) at gossipd/gossipd.c:595 #18 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=32764) at gossipd/gossipd.c:595 #19 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=65528) at gossipd/gossipd.c:595 #20 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=131056) at gossipd/gossipd.c:595 #21 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=262112) at gossipd/gossipd.c:595 #22 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=524225) at gossipd/gossipd.c:595 #23 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=1048450) at gossipd/gossipd.c:595 #24 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=2096900) at gossipd/gossipd.c:595 #25 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=4193801) at gossipd/gossipd.c:595 #26 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=8387603) at gossipd/gossipd.c:595 #27 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=16775207) at gossipd/gossipd.c:595 #28 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=33550414) at gossipd/gossipd.c:596 #29 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=67100829) at gossipd/gossipd.c:595 #30 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=134201659) at gossipd/gossipd.c:595 #31 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=268403318) at gossipd/gossipd.c:595 #32 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=536806636) at gossipd/gossipd.c:595 #33 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=1073613273) at gossipd/gossipd.c:595 #34 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=2147226547) at gossipd/gossipd.c:595 #35 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=4294453094) at gossipd/gossipd.c:595 #36 0x000000000040df26 in handle_query_channel_range (peer=0x3868fc8, msg=0x37e0678 "\001\ao\342\214\n\266\361\263r\301\246\242F\256c\367O\223\036\203e\341Z\b\234h\326\031") at gossipd/gossipd.c:625 The cause was that converting a block number to an scid truncates it at 24 bits. When we look through the index from (truncated number) to (real end number) we get every channel, which is too large to encode, so we iterate again. This fixes both that problem, and also the issue that we'd end up dividing into many empty sections until we get to the highest block number. Instead, we just tack the empty blocks on to then end of the final query. (My initial version requested 0xFFFFFFFE blocks, but the dev code which records what blocks were returned can't make a bitmap that big on 32 bit). Reported-by: George Vaccaro Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 11:34:45 -08:00
Rusty Russell	9f1f79587e	short_channel_id_dir: new primitive for one direction of short_channel_id Currently only used by gossipd for channel elimination. Also print them in canonical form (/[01]), so tests need to be changed. Suggested-by: @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	80753bfbd5	Feedback from @niftynei. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	dc2ee9639b	listchannels: allow source arg to list channels by their source node. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	358b7fda91	getroute: allow caller to specify maximum hops. This is required for routeboost. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	599ec5efbe	gossipd: allow an array of excluded channels for getroute_request. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	be64dd84ca	waitsendpay: indicate which channel direction the error was. You can figure this yourself by knowing the route, but it's better to report it directly here. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	c0cfddfa95	test/run-bench-find_route: fix so it runs properly. We didn't populate the channels properly so it always failed. Additionally, somewhere along the line we kept using the single scid so we only created one channel. Also, the next patch will start comparing the pubkeys, so make valid ones: use an array so we don't affect the benchmark too much. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	1567238dd9	invoice: option to expose/not-expose private channels. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	fe4a600bc7	routeboost: don't use channels to dead-end nodes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	547d6ab878	routeboost: expose private channel in invoice iff we have no public ones. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	f321b1d35f	getroute: remove seed arg, document fromid, make default fuzzpercent match docs. seed isn't very useful at this level: I've left it in routing.c because it might be useful for detailed testing. Pretty sure it's unused, so I simply removed it. The fuzzpercent is documented to default at 5%, but actually was 75%. Fix that too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Rusty Russell	26dda57cc0	utils: make tal_arr_expand safer. Christian and I both unwittingly used it in form: tal_arr_expand(&x) = tal(x, ...) Since '=' isn't a sequence point, the compiler can (and does!) cache the value of x, handing it to tal after* tal_arr_expand() moves it due to tal_resize(). The new version is somewhat less convenient to use, but doesn't have this problem, since the assignment is always evaluated after the resize. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-01-15 12:01:38 +01:00
Christian Decker	659a26ea5a	misc: Update short_channel_id representation to use 'x' separators Reported-by: Alex Bosworth <@alexbosworth> Signed-off-by: Christian Decker <decker.christian@gmail.com>	2019-01-15 03:50:27 +00:00
Christian Decker	94eb2620dc	bolt: Updated the BOLT specification to the latest version This is mainly just copying over the copy-editing from the lightning-rfc repository. [ Split to just perform changes after the UNKNOWN_PAYMENT_HASH change --RR ] Signed-off-by: Christian Decker <decker.christian@gmail.com> Reported-by: Rusty Russell <@rustyrussell>	2019-01-15 02:19:56 +00:00
Christian Decker	65054ae72e	bolt: Updated the BOLT specification to a07dc3df3b4611989e3359f28f96c574f7822850 This is mainly just copying over the copy-editing from the lightning-rfc repository. [ Split to just perform changes prior to the UNKNOWN_PAYMENT_HASH change --RR ] Signed-off-by: Christian Decker <decker.christian@gmail.com> Reported-by: Rusty Russell <@rustyrussell>	2019-01-15 02:19:56 +00:00
Rusty Russell	23540fe956	common: make funding_tx and withdraw_tx share UTXO code. They both do the same thing: convert utxos into tx inputs. Share code. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-12-06 23:11:51 +01:00
Rusty Russell	ab735dcbe6	gossipd: wire up memleak detection. For simplicity we dump leaks to logs, and just return a bool to master. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-22 05:15:42 +00:00
Rusty Russell	78771ca371	gossipd: mark timers as not being leaks. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-22 05:15:42 +00:00
Rusty Russell	5a81dbd783	common/daemon: enable/cleanup memleak in daemon_setup / daemon_shutdown. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-22 05:15:42 +00:00
Rusty Russell	29b672b117	gossipd: hear no wumbo. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 21:43:37 +00:00
Rusty Russell	9620393109	gossipd: store chainparams internally. We keep a chain_hash in struct daemon, becayse otherwise we end up with `&peer->daemon->rstate->chainparams->genesis_blockhash` which is a bit ridiculous. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 21:43:37 +00:00
Rusty Russell	5312ec1e34	gossipd: add documentation comments now it's relatively understandable. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	ea2c03e2e2	gossipd: don't have code to exit final loop; we always leave via master_gone. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	4038061d0f	gossipd: use take() in getroute_req. Trivial optimization. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	5c60d7ffb2	gossipd: split wire types into msgs from lightningd and msgs from per-peer daemons This avoids some very ugly switch() statements which mixed the two, but we also take the chance to rename 'towire_gossip_' to 'towire_gossipd_' for those inter-daemon messages; they're messages to gossipd, not gossip messages. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	07b16e37d0	daemon_conn: don't rely on outq_empty callback telling us to retry queue. We had at least one bug caused by it not returning true when it had queued something. Instead, just re-check thq queue after it's called. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	4e9eba1965	gossipd: rework query_channel_range to accept overlapping range. We shouldn't insist on an exact reponse match: they can batch it and send a whole batch, as long as it overlaps what we ask. We also change to a bitmap to save some memory. This isn't note in the CHANGELOG since we don't actually send gossip range queries except for testing. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	363564301f	gossipd: be more rigorous in handling peer messages vs. daemon requests. Messages from a peer may be invalid in many ways: we send an error packet in that case. Rather than internally calling peer_error, however, we make it explicit by having the handle_ functions return NULL or an error packet. Messages from the daemon itself should not be invalid: we log an error and close the fd to them if it is. Previously we logged an error but didn't kill them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Rusty Russell	1bd76861fd	gossipd: reorder functions into related groups (MOVEONLY) It's MOVEONLY but for the removal of the '#ifndef TESTING' which was needed for old test code. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-11-21 00:36:31 +00:00
Christian Decker	8e83d43c39	opts: Split early from non-early args so plugins can register theirs The idea is that `plugin` is an early arg that is parsed (from command line or the config file). We can then start the plugins and have them tell us about the options they'd like to add to the mix, before we actually parse them. Signed-off-by: Christian Decker <@cdecker>	2018-11-13 00:44:50 +01:00
Rusty Russell	3c97f3954e	daemon_conn: make it a tal object, typesafe callbacks. It means an extra allocation at startup, but it means we can hide the definition, and use standard patterns (new_daemon_conn and typesafe callbacks). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-29 04:06:16 +00:00
Rusty Russell	0e6aec081a	gossipd: make sure that freeing peer closes connection to it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-29 04:06:16 +00:00
Rusty Russell	689d51cba5	common/daemon_conn: remove finished function. For the moment, caller sets it manually. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-29 04:06:16 +00:00
Rusty Russell	c236361efd	wireaddr: update bolt version, remove 'padding' from addresses. Nobody used this, so it was removed from the spec. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-28 23:51:05 +00:00
Rusty Russell	66dcba099d	gossipd: hand raw pubkeys in getnodes and getchannels entries. We spend quite a bit of time in libsecp256k1 moving them to and from DER encoding. With a bit of care, we can transfer the raw bytes from gossipd and manually decode them so a malformed one can't make us abort(). Before: real 0m0.629000-0.695000(0.64985+/-0.019)s After: real 0m0.359000-0.433000(0.37645+/-0.023)s At this point, the main issues are 11% of time spent in ccan/io's backend_wake (I tried using a hash table there, but that actually makes the small-number-of-fds case slower), and 65% of gossipd's time is in marshalling the response (all those tal_resize add up!). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-19 22:02:11 +00:00
Rusty Russell	bbc36a7bec	gossipd: update node announcement even if we change within a second. Usually Travis triggers corner cases because it's so slow, but this time the moons aligned, and it managed to fail test_node_reannounce because it generated the updated node_announcement with the same timestamp as the old one. This is because we only updated "last_announce_timestamp" when we generated the announcement, not when we got it off the wire or loaded it from the gossip store. The fix is to ask the routing code what the latest timestamp is; we could still generate a clashing timestamp if (1) the gossip store is lost, and (2) we restart within one second. Hard to care. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-16 04:24:03 +00:00
lisa neigut	0ae1d03513	BOLT7: broadcast `htlc_maximum_msat` in `channel_update s Have c-lightning nodes send out the largest value for `htlc_maximum_msat` that makes sense, ie the lesser of the peer's max_inflight_htlc value or the total channel capacity minus the total channel reserve.	2018-10-16 03:32:27 +00:00
Rusty Russell	afac01380d	gossipd: don't initialize broadcast interval, make field name explicit. We initialize it to 30 seconds, but it's always overridden by the gossip_init message (and usually to 60 seconds, so it's doubly misleading). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-15 23:04:17 +00:00
Rusty Russell	3991425111	gossipd: don't accept forwarding short_channel_ids we don't own. Gossipd provided a generic "get endpoints of this scid" and we only use it in one place: to look up htlc forwards. But lightningd just assumed that one would be us. Instead, provide a simpler API which only returns the peer node if any, and now we handle it much more gracefully. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-15 23:04:17 +00:00
Rusty Russell	030fe1ce53	gossipd: don't expose private channels for routeboost. We don't create unannouncable channels, but other implementations can. Not only is it rude to expose these via invoices, it's probably not useable anyway. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-15 23:04:17 +00:00
lisa neigut	762c795c9b	gossip: reject channel_update with invalid `htlc_max_msat` If the channel update signals an invalid `htlc_maximum_msat` value, we ignore the update.	2018-10-09 23:22:52 +00:00
lisa neigut	1b6bd3fded	wire: add test for parsing optional version of channel_update	2018-10-09 23:22:52 +00:00
lisa neigut	a289282bad	gossipd: use u64 for `htlc_minimum_msat` field It's u64 in the spec, so we should use u64 too.	2018-10-09 23:22:52 +00:00
lisa neigut	b9331e5ac8	gossipd: parse and respect optional `htlc_maximum_msat` If another channel has set the optional `htlc_maximum_msat` field, we should correctly parse that field and respect it when drawing up routes for payments.	2018-10-09 23:22:52 +00:00
Rusty Russell	de37586a97	gossipd: use riskfactor in getroute, not "1". AFAICT, this was there in the original commit by @cdecker. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-09 08:40:52 +00:00
Rusty Russell	d946e965a6	gossipd: test that fromwire from lightningd messages succeeds. Also tiny drive-by cleanup for gossip_disable_local_channels to modern form. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-09 08:40:52 +00:00
Rusty Russell	864812019f	gossipd: use tal_arr_expand instead of open-coding it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-09 08:40:52 +00:00
Rusty Russell	915ffe35ed	gossipd: clean up getnodes handling. globalfeatures should not be accessed if we haven't received a channel_update. Treat it like the other fields which are only initialized and marshalled/unmarshalled if the timestamp is positive. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-09 08:40:52 +00:00
Rusty Russell	df27fc55af	More renaming of gfeatures to globalfeatures. Use the BOLT #1 naming. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-10-09 08:40:52 +00:00
Rusty Russell	bb5e2ffafb	gossipd: don't create redundant node_announcements. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 18:20:17 +02:00
Rusty Russell	afc92dd757	gossipd: use array[32] not pointer for alias. And use ARRAY_SIZE() everywhere which will break compile if it's not a literal array, plus assertions that it's the same length. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 18:20:17 +02:00
Rusty Russell	0baa5f7071	gossipd: send node announcement on startup. I suspect this fixes #1660 too, but checking would be good. Fixes: #1781 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 18:20:17 +02:00
Rusty Russell	2f667c5227	gossipd: routine to get route_info for known incoming channels. For routeboost, we want to select from all our enabled channels with sufficient incoming capacity. Gossipd knows which are enabled (ie. we have received a `channel_update` from the peer), but doesn't know the current incoming capacity. So we get gossipd to give us all the candidates, and lightningd selects from those. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 15:03:42 +02:00
Rusty Russell	f64eee717d	gossipd: make helpers const-correct. Always be const if you can. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 15:03:42 +02:00
Rusty Russell	95c9a73fbb	gossipd: set sent flag when sending reply_short_channel_ids_end Otherwise, if we don't announce the last node, we'll not flush this out; it will be delayed until the next time we send gossip! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 14:39:25 +02:00
Rusty Russell	fbb7bafc3b	gossipd: don't include channel in query_short_channel_ids reply if no channel_update. This is consistent: we don't broadcast a channel_announce until we've seen a channel_update, so we probably shouldn't advertise it here. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 14:39:25 +02:00
Rusty Russell	41b0872f58	Use localfeatures and globalfeatures consistently. That's what BOLT #1 calls them; make it easier for people to grep. Reported-by: @niftynei Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-28 04:14:28 +00:00
Rusty Russell	96f05549b2	common/utils.h: add tal_arr_expand helper. We do this a lot, and had boutique helpers in various places. So add a more generic one; for convenience it returns a pointer to the new end element. I prefer the name tal_arr_expand to tal_arr_append, since it's up to the caller to populate the new array entry. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-27 22:57:19 +02:00
Rusty Russell	e450c6bbdb	gossipd: remove time-delayed local channel_update, produce DISABLE on-demand. We have a lot of infrastructure to delay local channel_updates to avoid spamming on each peer reconnect; we had to keep tracking of pending ones though, in case we needed the very latest for sending an error when failing an HTLC. Instead, it's far simpler to set the local_disabled flag on a channel when we disconnect, but only send a disabling channel_update if we actually fail an HTLC. Note: handle_channel_update() TAKES update (due to tal_arr_dup), but we didn't use that before. Now we do, add annotation. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-26 03:21:35 +00:00
Rusty Russell	16e16a725e	gossipd: apply private updates to announce channel. We trade channel_update before channel_announce makes the channel public, and currently forget them when we finally get the channel_announce. We should instead apply them, and not rely on retransmission (which we remove in the next patch!). This earlier channel_update means test_gossip_jsonrpc triggers too early, so have that wait for node_announcement. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-26 03:21:35 +00:00
Rusty Russell	66105e83ea	gossipd: simplify "broadcast channel_announcement now we have channel_update" logic It's simpler and more robust to just check that it's not yet announced (the broadcast index will be 0). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-26 03:21:35 +00:00
Rusty Russell	8455b12781	Revert "gossipd: handle premature node_announcements in the store." This reverts commit `e2f426903d`. With the new store version, this can't happen. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-21 17:56:15 +02:00
Rusty Russell	48de77d56e	gossipd: invalidate old gossip_stores. Incrementing version number means stores which were prior to the previous commit will be removed, and refreshed. The simplest fix, if not the most efficient. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-21 17:56:15 +02:00
lisa neigut	b1ceaf9910	gossipd: Update BOLT-split flags in channel_update BOLT 7's been updated to split the flags field in `channel_update` into two: `channel_flags` and `message_flags`. This changeset does the minimal necessary to get to building with the new flags.	2018-09-21 00:24:12 +00:00
Rusty Russell	e012e94ab2	hsmd: rename hsm_client_wire_csv to hsm_wire.csv That matches the other CSV names (HSM was the first, so it was written before the pattern emerged). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-20 09:49:39 +02:00
Rusty Russell	8f1f1784b3	hsmd: remove hsmd/client.c It was only used by handshake.c. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-20 09:49:39 +02:00
Rusty Russell	704d30edce	ping: complete JSON RPC ping commands even if one ping gets no response. We would never complete further ping commands if we had < responses than pings. Oops. Fixes: #1928 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-14 22:11:23 +02:00
Rusty Russell	97c7ba2f80	gossipd: fix reordering of node_announcements in presence of a unannounced channel. If we receive a channel_announce but not a channel_update, we store the announce but don't put it in the broadcast map. When we delete a channel, we check if the node_announcement broadcast now preceeds all channel_announcements, and if so, we move it to the end of the map. However, with a channel_announcement at index '0', this test fails. This is at least one potential cause of the node map getting out of order. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-04 14:36:05 +02:00
Rusty Russell	e2f426903d	gossipd: handle premature node_announcements in the store. These happen after we compact the store; every log I've seen of a restart on a real node has a message about truncating the store, because node_announcements predate channel_announcements. I extracted one such case from testnet, and reduced it to test here. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-04 14:36:05 +02:00
Rusty Russell	0d46a3d6b0	Put the 'd' back in the daemons. @renepickhardt: why is it actually lightningd.c with a d but hsm.c without d ? And delete unused gossipd/gossip.h. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-03 05:01:40 +00:00
Rusty Russell	317a830e94	devtools: dump-gossipstore. Not very useful by itself, but when combined with decodemsg it can tell us quite a bit. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-09-03 00:39:06 +00:00
Rusty Russell	f80955c932	broadcast: don't leak in broadcast_del. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-24 19:54:32 +02:00
Rusty Russell	5d1f71c3c0	gossipd: don't leak fields in create_node_announcement. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-24 19:54:32 +02:00
Rusty Russell	a475098928	gossipd: fix leak in gossip_store_add_channel_delete. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-24 19:54:32 +02:00
Rusty Russell	1c81486b48	routing: fix falsely flagged leak. pending goes away on a timer, sure, but might as well use tmpctx here. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-24 19:54:32 +02:00
Rusty Russell	b10bae1ceb	gossipd: use ctx arg in create_channel_update. Turns out it was always `tmpctx` anyway, so this isn't a real bug right now. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-24 19:54:32 +02:00
Rusty Russell	2db77f5d1d	gossipd: minor modifications for memleak detection to work. 1. Move the list to the start of `struct peer`: memleak walks the list correctly this way. 2. Don't create tal parent loop daemon->conn->daemon. The second one is silly anyway: we exit via master_gone when the master conn is closed. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-24 19:54:32 +02:00
Rusty Russell	83eadb3548	gossipd: fix SUPERVERBOSE usage, enhance, when turned on. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-23 14:46:22 +02:00
Rusty Russell	74521b3fb7	gossipd: don't delay the very first channel_update. Lightning charge tests stopped working without a timeout, being unable to find a route. The 15 second delay doesn't matter in real life, but in these scenarios it does. This fixes it by making sure the channel is usable immediately. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-21 00:49:12 +02:00
conanoc	b1900b18ab	Fix DEVELOPER guard for ping ping_req() should be outside of DEVELOPER guard now.	2018-08-15 06:48:55 +00:00
Christian Decker	6627da5eb5	routing: Do not consider risk when capping transfers Reported-by: Rusty Russell <@rustyrussell> Signed-off-by: Christian Decker <@cdecker>	2018-08-06 22:46:02 +02:00
Christian Decker	84905eac2b	routing: Make the capacity a parameter to new_chan As pointed out by @rustyrussell the capacity is now always defined, so we can fold that into the construction of the channel itself. Reported-by: Rusty Russell <@rustyrussell> Signed-off-by: Christian Decker <@cdecker>	2018-08-06 22:46:02 +02:00
Christian Decker	8201764117	routing: Skip channels that require larger HTLCs than we are routing The `htlc_minimum_msat` parameter was ignored so far, and we'd be attempting to pay and hitting a brick wall by doing so. This patch just skips channels that are not eligible anyway.	2018-08-06 22:46:02 +02:00
Christian Decker	14000a22bc	routing: Skip channels that don't have sufficient capacity We know the total channel capacity after checking for its existence on-chain, so we can actually make use of that information to discard channels that don't have a sufficient capacity anyway, reducing the number of failed attempts.	2018-08-06 22:46:02 +02:00
Christian Decker	8a34933c1a	gossip: Annotate locally added channels with their capacity We were adding channels without their capacity, and eventually annotated them when we exchanged `channel_update`s. This worked as long as we weren't considering the channel capacity, but would result in local-only channels to be unusable once we start checking.	2018-08-06 22:46:02 +02:00
Rusty Russell	584ee26200	gossipd: fix thinko in node_announcement address parsing which made us miss final address 'cursor < ser + max' isn't valid because we reduce 'max' as we go! Effectively we'll stop once we're past halfway, which can only happen with ipv6 + a torv2 address. Ths fix is one-line, but we rename 'max' to 'len' which makes its purpose clearer. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-06 19:33:46 +02:00
Rusty Russell	0b08601951	sync_crypto_write/sync_crypto_read: just fail, don't return NULL. There's only one thing the caller ever does, just do that internally. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-08-05 02:03:58 +00:00
practicalswift	7969cc335e	Allocate off ctx instead of tmpctx in encode_short_channel_ids_start(const tal_t *ctx)	2018-08-01 13:09:16 +09:30
practicalswift	b5682a773b	Remove dead stores	2018-07-31 12:45:02 +02:00
Rusty Russell	5cf34d6618	Remove tal_len, use tal_count() or tal_bytelen(). tal_count() is used where there's a type, even if it's char or u8, and tal_bytelen() is going to replace tal_len() for clarity: it's only needed where a pointer is void. We shim tal_bytelen() for now. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-07-30 11:31:17 +02:00
Rusty Russell	36730ddb6d	gossipd: dev-suppress-gossip. Useful for testing that we only get an update via the error message. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-07-27 14:12:00 +02:00
Rusty Russell	73b3782943	gossipd: send latest update in error message, even if delayed. We delay internally to reduce broadcastig route flap, but errors are a special case: we want to send the latest, otherwise we might send an old (non-disabled) update. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-07-27 14:12:00 +02:00
Rusty Russell	3c66d5fa03	gossipd: add flag for locally disabling channel. We used to just manually set ROUTING_FLAGS_DISABLED, but that means we then suppressed the real channel_update because we thought it was a duplicate! So use a local flag: set it for the channel when the peer disconnects, and clear it when channeld sends a local update. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-07-27 14:12:00 +02:00
Rusty Russell	d241bd762c	connectd: don't use gossip_getnodes_entry. gossip_getnodes_entry was used by gossipd for reporting nodes, and for reporting peers. But the local_features field is only available for peers, and most other fields are only available from node_announcement. Note that the connectd change actually means we get less information about peers: gossipd used to do the node lookup for peers and include the node_announcement information if it had it. Since generate_wire.py can't create arrays-of-arrays, we add a 'struct peer_features' to encapsulate the two feature arrays for each peer, and for convenience we add it to lightningd/gossip_msg. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2018-07-25 02:13:52 +00:00

1 2 3 4 5 ...

679 Commits