wireshark

osmith

Author	SHA1	Message	Date
Dario Lombardo	1e8d3d8695	tshark: fix duplicate fields issue in ek output. Fix: #18036.	2022-12-15 12:22:30 +00:00
John Thacker	74b1c07f91	test: Skip Follow HTTP2 test without Nghttp2 The expected test output is with the headers decompressed, which we can't do without Nghttp2. (It outputs the compressed headers if we don't have it, so we could test for that instead.) Fix #18707	2022-12-06 07:21:58 -05:00
Peter Wu	df478a365d	dfilter: treat carriage returns as whitespace Fixes #18595	2022-11-07 01:00:50 +00:00
João Valverde	b83658d8a4	dfilter: Add suport for raw addressing with references Extends raw adressing syntax to wok with references. The syntax is @field1 == ${@field2} This requires replicating the logic to load field references, but using raw values instead. We use separate hash tables for that, namely "references" vs "raw_references".	2022-10-31 21:02:39 +00:00
João Valverde	0853ddd1cb	dfilter: Add support for raw (bytes) addressing mode This adds new syntax to read a field from the tree as bytes, instead of the actual type. This is a useful extension for example to match matformed strings that contain unicode replacement characters. In this case it is not possible to match the raw value of the malformed string field. This extension fills this need and is generic enough that it should be useful in many other situations. The syntax used is to prefix the field name with "@". The following artificial example tests if the HTTP user agent contains a particular invalid UTF-8 sequence: @http.user_agent == "Mozill\xAA" Where simply using "http.user_agent" won't work because the invalid byte sequence will have been replaced with U+FFFD. Considering the following programs: $ dftest '_ws.ftypes.string == "ABC"' Filter: _ws.ftypes.string == "ABC" Syntax tree: 0 TEST_ANY_EQ: 1 FIELD(_ws.ftypes.string <FT_STRING>) 1 FVALUE("ABC" <FT_STRING>) Instructions: 00000 READ_TREE _ws.ftypes.string <FT_STRING> -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_EQ reg#0 == "ABC" <FT_STRING> 00003 RETURN $ dftest '@_ws.ftypes.string == "ABC"' Filter: @_ws.ftypes.string == "ABC" Syntax tree: 0 TEST_ANY_EQ: 1 FIELD(_ws.ftypes.string <RAW>) 1 FVALUE(41:42:43 <FT_BYTES>) Instructions: 00000 READ_TREE @_ws.ftypes.string <FT_BYTES> -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_EQ reg#0 == 41:42:43 <FT_BYTES> 00003 RETURN In the second case the field has a "raw" type, that equates directly to FT_BYTES, and the field value is read from the protocol raw data.	2022-10-31 21:02:39 +00:00
Miroslav Lichvar	d892d28481	NTP: Improve handling of poll and precision fields The poll and precision fields in timing NTP messages are signed integers. Different NTP implementations have different minimum and maximum polling intervals. Some can be configured even with negative values for sub-second intervals (e.g. down to -7 for 1/128th of a second). NTP clocks on modern systems and hardware typically have a sub-microsecond precision. Print all poll values. Add the raw precision and change the resolution of the printed value to nanoseconds.	2022-10-31 14:38:50 +00:00
João Valverde	603354203b	epan/proto: Replace format text() The proto.h APIs expect valid UTF-8 so replace uses of format_text() with a label copy function that just does formatting and does not check for encoding errors. Avoid multiple levels of temporary string allocations. Make sure the copy does not truncate a multibyte character and produce invalid strings. Add debug checks for UTF-8 encoding errors instead. We escape C0 and C1 control codes (because control codes) and ASCII whitespace (and bell). Overall the goal is to be more efficient and optimized and help detect misuse of APIs by passing invalid UTF-8. Add a unit test for ws_label_strcat.	2022-10-20 20:05:15 +01:00
João Valverde	a19834b98c	Windows: Store "gui.console_open" in the Windows registry This removes the last dependency of the logging subsystem on the preferences module. The latter is started much later than the former and this is an issue. The Windows-only preference "gui.console_open" is stored in the registry as HKEY_LOCAL_USER\Software\Wireshark\ConsoleOpen. The semantics are exactly the same. The preference is read by the logging subsystem for initialization and then again by the preferences (read/write) so the user can configure it as before. The code to store the preference also in the preferences file was kept, for backward compatibility and because it is not incompatible with using the Registry concurrently. The elimination of the prefs dependency also allows moving the Windows console logic to wsutil and add the functionality to wslog directly, thereby eliminating the superfluous Wireshark/Logray custom log handler. To be able to read the ws_log_console_open global variable from libwireshark it becomes necessary to add a new export macro symbol called WSUTIL_EXPORT.	2022-10-11 14:25:04 +01:00
João Valverde	0662a3f6ac	dfilter: Amend a numeric pattern in the scanner We amend the :<numeric> pattern to not eat the leading colon. Because the colon can be part of the value (with IPv6 addresses for example) we want to avoid doing that. IPv6 addresses are covered by their own rules but this removes the requirement in the future to handle any special cases and avoids surprises. For this reason the colon-prefix syntax is already explicitly defined to work only for byte arrays and there is currently no universal syntax for all literal values or even all numbers. Other numbers can keep using the lexical type "unparsed". ``` run/dftest "_ws.ftypes.uint8 == :fd" Filter: _ws.ftypes.uint8 == :fd dftest: ":fd" is not a valid number. _ws.ftypes.uint8 == :fd ^~~ run/dftest "_ws.ftypes.uint8 == fd" Filter: _ws.ftypes.uint8 == fd dftest: "fd" is not a valid number. _ws.ftypes.uint8 == fd ^~ run/dftest "_ws.ftypes.uint8 == 0xfd" Filter: _ws.ftypes.uint8 == 0xfd Syntax tree: 0 TEST_ANY_EQ: 1 FIELD(_ws.ftypes.uint8 <FT_UINT8>) 1 FVALUE(253 <FT_UINT8>) Instructions: 00000 READ_TREE _ws.ftypes.uint8 <FT_UINT8> -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_EQ reg#0 == 253 <FT_UINT8> 00003 RETURN run/dftest "_ws.ftypes.bytes == fd" Filter: _ws.ftypes.bytes == fd Syntax tree: 0 TEST_ANY_EQ: 1 FIELD(_ws.ftypes.bytes <FT_BYTES>) 1 FVALUE(fd <FT_BYTES>) Instructions: 00000 READ_TREE _ws.ftypes.bytes <FT_BYTES> -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_EQ reg#0 == fd <FT_BYTES> 00003 RETURN run/dftest "_ws.ftypes.bytes == :fd" Filter: _ws.ftypes.bytes == :fd Syntax tree: 0 TEST_ANY_EQ: 1 FIELD(_ws.ftypes.bytes <FT_BYTES>) 1 FVALUE(fd <FT_BYTES>) Instructions: 00000 READ_TREE _ws.ftypes.bytes <FT_BYTES> -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_EQ reg#0 == fd <FT_BYTES> 00003 RETURN ```	2022-10-08 09:51:49 +00:00
João Valverde	14f5121c4a	dfilter: Remove problematic <...> literal syntax The <...> syntax for literals, intended to be as generic as possible, unintentionally introduced an ambiguity with the relational expression "a < b or a > c". Literals are values like numbers, bytes, IPv6 addresses or, one could imagine, UNC paths for example, if an FT_UNC type were to be added in the future. We could use a new unique symbol like @...@ but the <...> syntax is very recent and may not be necessary with ":xxx" so just remove it. A byte array can be explicitly declared by prefixing with a colon. It is not as generic but the main ambiguity that this new syntax attempted to solve is bytes vs protocol names. We don't want to introduce a new reserved symbol for now, until other requirements if any are more clear. Fixes #18418.	2022-10-08 09:51:49 +00:00
John Thacker	f3df91b90d	tcp: Fix handling of retransmission for reassembly When reassemble_ooo is set, we can do a better job determining if a segment is retransmitted or not from the reassembler perspective, which is different than what TCP sequence analysis determines, which is retransmission from the sender's perspective. This also allows us to have a good way of dealing with retransmission but with additional data. This only works when reassembling out of order is set. Without it, we fall back to the old method of detecting retransmissions, which has a harder time with the edge cases. Fix #17406, fix #15993, fix #13388, fix #13523.	2022-08-30 21:44:11 +00:00
Chuck Craft	b60240a8a6	spelling: "two pass" -> two-pass	2022-08-22 10:20:29 +00:00
Michael Tuexen	9cea2c26a1	TCP: Use RFC 6994 for experimental options Modernize the handling of experimental TCP options based on RFC 6994. In particular use ExID instead of magic (which in the context of RFC 6994 are the last two bytes of a 32-bit ExID) and add a desciption of ExID based on the current state of the IANA registry.	2022-08-17 21:33:31 +00:00
John Thacker	862803de5c	HTTP2: Send headers to the follow tap after decompression Field blocks (carried in HEADERS, PUSH_PROMISE, and CONTINUATION frames) are compressed by HPACK. Send them to the follow tap only after decompression. Update the tests to match the new output. Ping #18239 (There's still the case of gzip and brotli compressed DATA frames to handle).	2022-08-08 23:50:20 +00:00
Stig Bjørlykke	bfe8187608	test: Add dfilter 'double' tests Add test cases for scientific notation and not equal.	2022-07-26 10:32:16 +00:00
John Thacker	cb3fd3b5cd	quic: Handle out-of-order CRYPTO frames, aka "Chaos Protection" Implement out of order buffering and desegmentation for QUIC CRYPTO frames. Particularly useful for Chrome's "Chaos Protection" that intentionally introduces them, but handles out of order CRYPTO frames in different UDP payloads as well. (Buffering packets at a higher encryption level until the out of order lower level frames have arrived is a different issue.) Adds a preference, which defaults to on since if there is out of order, it's not very useful to turn it off. Fix #17732. Fix #18215.	2022-07-24 23:27:38 -04:00
João Valverde	a877f2d5f3	dfilter: Allow existence check for slices Allow checking if a slice exists. The result is true if the slice has length greater than zero. The len() function is implemented as a DFVM instruction instead. The semantics are the same.	2022-07-04 22:45:14 +00:00
João Valverde	b10db887ce	dfilter: Remove unparsed syntax type and RHS literal bias This removes unparsed name resolution during the semantic check because it feels like a hack to work around limitations in the language syntax, that should be solved at the lexical level instead. We were interpreting unparsed differently on the LHS and RHS. Now an unparsed value is always a field if it matches a registered field name (this matches the implementation in 3.6 and before). This requires tightening a bit the allowed filter names for protocols to avoid some common and potentially weird conflicting cases. Incidentally this extends set grammar to accept all entities. That is experimental and may be reverted in the future.	2022-07-02 11:18:20 +01:00
João Valverde	aaff0d21ae	dfilter: Add layer support for references This adds support for using the layers filter with field references. Before: $ dftest 'ip.src != ${ip.src#2}' dftest: invalid character in macro name After: $ dftest 'ip.src != ${ip.src#2}' Filter: ip.src != ${ip.src#2} Syntax tree: 0 TEST_ALL_NE: 1 FIELD(ip.src <FT_IPv4>) 1 REFERENCE(ip.src#[2:1] <FT_IPv4>) Instructions: 00000 READ_TREE ip.src <FT_IPv4> -> reg#0 00001 IF_FALSE_GOTO 5 00002 READ_REFERENCE_R ${ip.src <FT_IPv4>} #[2:1] -> reg#1 00003 IF_FALSE_GOTO 5 00004 ALL_NE reg#0 != reg#1 00005 RETURN This requires adding another level of complexity to references. When loading references we need to copy the 'proto_layer_num' and add the logic to filter on that. The "layer" sttype is removed and replace by a new field sttype with support for a range. This is a nice cleanup for the semantic check and general simplification. The grammar is better too with this design. Range sttype is renamed to slice for clarity.	2022-06-25 14:57:40 +01:00
João Valverde	e9e6431d7b	dfilter: Change boolean string representation Use "True" or "TRUE" instead of "true" and remove case insensivity. Same for false. This should serve to differentiate booleans a bit more from protocol names, which should be using lower-case.	2022-06-25 13:02:34 +01:00
João Valverde	47348ae598	dfilter: Add support for literal strings with null bytes Before: Filter: frame matches "abc\x00def" dftest: \x00 (NUL byte) cannot be used with a regular string. frame matches "abc\x00def" ^~~~ Filter: _ws.ftypes.string == "a string with a \0 byte" dftest: \0 (NUL byte) cannot be used with a regular string. _ws.ftypes.string == "a string with a \0 byte" ^~ After: Filter: frame matches "abc\x00def" Syntax tree: 0 TEST_MATCHES: 1 FIELD(frame) 1 PCRE(abc\0def) Instructions: 00000 READ_TREE frame -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_MATCHES reg#0 matches abc\0def 00003 RETURN Filter: _ws.ftypes.string == "a string with a \0 byte" Syntax tree: 0 TEST_ANY_EQ: 1 FIELD(_ws.ftypes.string) 1 FVALUE("a string with a \0 byte" <FT_STRING>) Instructions: 00000 READ_TREE _ws.ftypes.string -> reg#0 00001 IF_FALSE_GOTO 3 00002 ANY_EQ reg#0 == "a string with a \0 byte" <FT_STRING> 00003 RETURN Fixes issue #16156.	2022-06-21 15:10:08 +00:00
Chuck Craft	d008708552	editcap/mergecap: swap 'v'\|'V' options to match other CLI utilities Closes #18134	2022-06-16 02:13:50 +00:00
Chuck Craft	208cf56b75	ip: ip.flags field are 3 high bits not full byte See https://ask.wireshark.org/question/27546/0x01-flag-on-last-of-fragmented-packets/	2022-06-14 20:28:17 +00:00
João Valverde	de103394fe	dfilter: Make regex matches case insensitive by default	2022-06-08 12:17:22 +01:00
João Valverde	2939c6d0dc	Version info: Do not show plugin count Plugin count is dependant on initialization order and showing zero plugins loaded can be misleading.	2022-06-05 21:31:36 +00:00
João Valverde	8ff24c2b6e	Lua: Add lrexlib-pcre2 Lua regex bindings Add the de facto standard Lua regex API to Wireshark. Upstream code is copied verbatim and the module opened in the "rex" table. This is just a user convenience and developer quality of life improvement over the GRegex Lua API because it has always been possible to load lrexlib-pcre2 as a Lua module from Wireshark.	2022-05-26 10:09:13 +00:00
João Valverde	e6da2cd8de	Lua: Remove on-life-support GRegex bindings This code has been unmaintained and does not pass the lrexlib test suite. GRegex itself has been obsolescent for some time, although GNOME has recently restarted trying to move it to PCRE2. Remove it in preparation for a move to lrexlib-pcre2.	2022-05-26 10:09:13 +00:00
John Thacker	7ab343e7d6	tcp: Split MSPs in out of order processing When processing segments out of order in TCP, it is possible to get new segments that fill a sequence gap and be able to dissect at least one PDU but need more data for additional PDUs (that have data from the contiguous stream bytes.) We can only determine this after passing the reassembled segments to the subdissector first. To keep dissection and layer numbers consistent between passes, split the multisegment PDU, keeping the already dissect PDU(s) in the current reassembly and creating a new MSP for the parts not yet dissected. Update the dissection test to enable the currently skipped test that require MSP splitting and remove test_tcp_out_of_order_twopass_with_bug	2022-05-26 00:49:16 +00:00
João Valverde	51de43cfd2	dfilter: Fix protocol slices with negative indexes Field infos have a length property that was not stored with the field value so when using a negative index the end was computed from the captured length of the frame tvbuff, leading to incorrect results. The documentation in wireshark-filter(5) describes how this was supposed to work but as far as I can tell it never worked properly. We now store the length and use that (when it is different from -1) to locate the end of the protocol data in the tvbuff. An extra wrinkle is that sometimes the length is set after the field value is created. This is the most common case as the majority of protocols have a variable length and dissection generally proceeds with a TVB subset from the current layer (with offset zero) through all remaining layers to the end of the captured length. For that reason we must use an expedient to allow changing the protocol length of an existing protocol fvalue, whenever proto_item_set_len() is called. Fixes #17772.	2022-05-23 23:04:07 +01:00
João Valverde	b602911b31	dfilter: Add support for universal quantifiers Adds the keywords "any" and "all" to implement the quantification to any existing relational operator. Filter: all tcp.port in {100, 2000..3000} Syntax tree: 0 ALL TEST_IN: 1 FIELD(tcp.port) 1 SET(#2): 2 FVALUE(100 <FT_UINT16>) 2 FVALUE(2000 <FT_UINT16>) .. FVALUE(3000 <FT_UINT16>) Instructions: 00000 READ_TREE tcp.port -> reg#0 00001 IF_FALSE_GOTO 5 00002 ALL_EQ reg#0 === 100 <FT_UINT16> 00003 IF_TRUE_GOTO 5 00004 ALL_IN_RANGE reg#0 in { 2000 <FT_UINT16> .. 3000 <FT_UINT16> } 00005 RETURN	2022-05-12 14:26:54 +01:00
João Valverde	4f3f507eee	dfilter: Add syntax to match specific layers in the protocol stack Add support to display filters for matching a specific layer within a frame. Layers are counted sequentially up the protocol stack. Each protocol (dissector) that appears in the stack is one layer. LINK-LAYER#1 <-> IP#1 <-> TCP#1 <-> IP#2 <-> TCP#2 <-> etc. The syntax allows for negative indexes and ranges with the usual semantics for slices (but note that counting starts at one): tcp.port#[2-4] == 1024 Matches layers 2 to 4 inclusive. Fixes #3791.	2022-04-26 16:50:59 +00:00
John Thacker	d87b9ca731	test: Remove libgcrypt checks for 1.8 and below libgcrypt 1.8.0 or higher is required now, so remove the checks to skip tests without it	2022-04-21 06:32:44 -04:00
João Valverde	eb2a9889c3	dfilter: Add abs() function Add an absolute value function for ftypes.	2022-04-18 17:09:00 +01:00
Richard Sharpe	8889d8c78b	ieee80211: Implement more KDEs from ieee802.11-2016 and on. I have picked up more KDEs from ieee802.11-2016 and Draft P802.11REVmd_D5.0 and Draft P802.11be_D1.4.	2022-04-17 23:59:26 +00:00
João Valverde	cef02cc3a0	dfilter: Add max()/min() tests and documentation	2022-04-14 13:07:41 +00:00
João Valverde	0dba7456aa	tests: Remove leftover debug print	2022-04-13 01:15:11 +01:00
Gerald Combs	ca04f4c8cb	Qt: Update our dynamic menu groups. Add log-specific statistics groups, and use them to limit the dynamic menu items in Wireshark and Logwolf.	2022-04-12 12:14:03 -07:00
João Valverde	8355e96858	tests: Add test for display filter field reference	2022-04-12 14:03:18 +00:00
João Valverde	da19379eb5	dfilter: Create the syntax node in the scanner and pass that Revert to passing a syntax node from the lexical scanner to the grammar parser. Using a union is not having a discernible advantage and requires duplicating a lot of properties of syntax nodes.	2022-04-10 09:54:03 +01:00
João Valverde	fb9a176587	dfilter: Allow grouping arithmetical expressions with { } This removes the limitation of having only two terms in an arithmetic expression and allows setting the precedence using curly braces (like any basic calculator). Our grammar currently does not allow grouping arithmetic expressions using parenthesis, because boolean expressions and arithmetic expressions are different and parenthesis are used with the former.	2022-04-08 23:12:04 +01:00
João Valverde	0313cd02bc	dfilter: Fix RHS bias for literal values Fixes `a3b76138f0`.	2022-04-06 23:46:22 +01:00
João Valverde	c30a417528	dflter: Add test	2022-04-06 18:37:23 +01:00
João Valverde	5584aba326	dfilter: Fix slice using range [:j] Fixes: $ dftest 'frame[:10] contains 0xff' dftest: ":10" is not a valid range.	2022-04-06 18:35:10 +01:00
João Valverde	a6f37323e6	dfilter: Clean up lexical scanning	2022-04-06 18:11:27 +01:00
João Valverde	6057d1a6e2	dfilter: Add more IPv6 tests	2022-04-06 18:09:12 +01:00
John Thacker	c2e1ee2e57	tcp: Rework out of order dissection to dissect sooner Rework the out of order dissection to store the out of order segments and add them to reassemblies only after the gaps are filled. This allows reassembly of contiguous segments to be dissected when they can, instead of having to wait for all segment gaps to be filled. In cases where a segment has an erroneous later sequence number, this prevents reassembly from being completely halted. It is now guaranteed that when the subdissector is called that the segment from the current frame is either the first segment of the MSP or has bytes that were requested from the last call of the subdissector. This makes it easier to split MSPs in a later commit. MSPs now always have the first segment with the sequence number, so MSP_FLAGS_MISSING_FIRST_SEGMENT and first_frame_with_seq are obsolete and can be removed later. This fixes a long standing TODO in the out of order test in suite_dissection.py Dissection is more consistent between the first pass and later passes, though there is more to be done.	2022-04-06 08:41:58 -04:00
João Valverde	12c8cc32f0	dfilter: Fix parsing of some IPv6 compressed addresses Fix parsing of some IPv6 addresses and add tests. Also pass tokens as unparsed unless the user was specfic about the semantic type. For example the IPv4 address 1.1.1.1 is also a valid field, but 1.1.1.1/128 is not (because of the slash). However choose not to enforce the distinction in the lexical scanner and pass everything as unparsed unless the meaning is explicit in the syntax with leading dot, colon, or between angle branckets.	2022-04-06 10:10:04 +01:00
João Valverde	7ed5d5036e	dfilter: restore support for identifiers using hyphen Restores support for filters such as "mac-lte", that was broken in `330d408328`. This means we are not able to support arithmetic expressions with binary minus without spaces. $ dftest 'tcp.port == 1-2' dftest: "1-2" is not a valid number.	2022-04-05 15:38:20 +01:00
João Valverde	330d408328	dfilter: Allow arithmetic expressions without spaces To allow an arithmetic expressions without spaces, such as "1+2", we cannot match the expression in other lexical rules using "+". Because of longest match this becomes the token LITERAL or UNPARSED with semantic value "1+2". The same goes for all the other arithmetic operators. So we need to remove [+-/%] from "word chars" and add very specific patterns (that won't mistakenly match an arithmetic expression) for those literal or unparsed tokens we want to support using these characters. The plus was not a problem but right slash is used for CIDR, minus for mac address separator, etc. There are still some corner case. 11-22-33-44-55-66 is a mac address and not the arithmetic expression with six terms "eleven minus twenty two minus etc." (if we ever support more than two terms in the grammar, which we don't currently). We lift some patterns from the flex manual to match on IPv4 and IPv6 (ugly) and add MAC address. Other hypothetical literal lexical values using [+-/%] are already supported enclosed in angle brackets but the cases of MAC/IPv4/IPv6 are are very common and moreover we need to do the utmost to not break backward compatibily here. Before: $ dftest "_ws.ftypes.int32 == 1+2" dftest: "1+2" is not a valid number. After: $ dftest "_ws.ftypes.int32 == 1+2" Filter: _ws.ftypes.int32 == 1+2 Instructions: 00000 READ_TREE _ws.ftypes.int32 -> reg#0 00001 IF_FALSE_GOTO 4 00002 ADD 1 <FT_INT32> + 2 <FT_INT32> -> reg#1 00003 ANY_EQ reg#0 == reg#1 00004 RETURN	2022-04-04 20:28:55 +00:00
João Valverde	f0ca30b60b	dfilter: More arithmetic fixes Fix a failed assertion with constant arithmetic expressions. Because we do not parse constants on the lexical level it is more complicated to handle constant expressions with unparsed values. We need to handle missing type information gracefully for any kind of arithmetic expression, not just unary minus.	2022-04-02 18:10:33 +00:00

1 2 3 4 5 ...

682 Commits